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PREFACE 


PREFAGE- TO THE EIRST EDTTION 


Every aspect of this book was influenced by the desire to present calculus not 
merely as a prelude to but as the first real encounter with mathematics. Since 
the foundations of analysis provided the arena in which modern modes of math- 
ematical thinkmg developed, calculus ought to be the place in which to expect, 
rather than avoid, the strengthening of insight with logic. In addition to devel- 
oping the students’ intuition about the beautiful concepts of analysis, it is surely 
equally mportant to persuade them that precision and rigor are neither deterrents 
to mtuition, nor ends in themselves, but the natural medium in which to formulate 
and think about mathematical questions. 


This goal imphes a view of mathematics which, in a sense, the entire book 
attempts to defend. No matter how well particular topics may be developed, the 
goals of this book will be realized only if it succeeds as a whole. For this reason, it 
would be of little value merely to list the topics covered, or to mention pedagogical 
practices and other mnovations. Even the cursory glance customarily bestowed on 
new calculus texts will probably tell more than any such extended advertisement, 
and teachers with strong feelngs about particular aspects of calculus will know just 
where to look to see if this book fulfills ther requirements. 

A few features do require explicit comment, however. Of the twenty-nine chap- 
ters in the book, two (starred) chapters are optional, and the three chapters com- 
prismg Part V have been included only for the benefit of those students who might 
want to examine on their own a construction of the real numbers. Moreover, the 
appendices to Chapters 3 and 11 also contain optional material. 

The order of the remaining chapters is intentionally quite inflexible, since the 
purpose of the book is to present calculus as the evolution of one idea, not as a 
collection of “topics.” Since the most exciting concepts of calculus do not appear 
until Part IIE, it should be pointed out that Parts I and IT will probably require 
less ume than their length suggests —although the entire book covers a one-year 
course, the chapters are not meant to be covered at any uniform rate. A rather 
natural dividing pomt does occur between Parts IT and III, so it is possible to 
reach differentiation and integration even more quickly by treating Part II very 
briefly, perhaps returning later for a more detailed treatment. This arrangement 
corresponds to the traditional organization of most calculus courses, but I feel 
that it will only diminish the value of the book for students who have seen a 
small amount of calculus previously, and for bright students with a reasonable 
background. 

The problems have been designed with this particular audience in mind. ‘They 
range from straightforward, but not overly simple, exercises which develop basic 
techniques and test understanding of concepts, to problems of considerable difli- 
culty and, I hope, of comparable interest. “Vhere are about 625 problems in all. 
Those which emphasize manipulations usually contain many examples, numbered 
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with small Roman numerals, while small letters are used to label interrelated parts 
in other problems. Some indication of relative difficulty is provided by a system of 
starrmg and double starring, but there are so many criteria for judging difficulty, 
and so many hints have been provided, especially for harder problems, that this 
guide is not completely reliable. Many problems are so difficult, especially if the 
hints are not consulted, that the best of students will probably have to attempt only 
those which especially interest them; from the less difficult problems it should be 
easy to select a portion which will keep a good class busy, but not frustrated. ‘The 
answer section contains solutions to about half the examples from an assortment 
of problems that should provided a good test of technical competence. A separate 
answer book contains the solutions of the other parts of these problems, and of all 
the other problems as well. Finally, there is a Suggested Reading list, to which the 
problems often refer, and a glossary of symbols. 

I am grateful for the opportunity to mention the many people to whom I owe my 
thanks. Jane Bjorkgren performed prodigious feats of typing that compensated for 
my fitful production of the manuscript. Richard Serkey helped collect the material 
which provides historical sidelights in the problems, and Richard Weiss supplied 
the answers appearing in the back of the book. I am especially grateful to my 
friends Michael Freeman, Jay Goldman, Anthony Phillips, and Robert Wells for 
the care with which they read, and the relentlessness with which they criticized, a 
preliminary version of the book. Needles to say, they are not responsible for the 
deficiencies which remain, especially smce I sometimes rejected suggestions which 
would have made the book appear suitable for a larger group of students. I must 
express my admiration for the editors and staff of W. A. Benjamin, Inc., who were 
always eager to increase the appeal of the book, while recognizing the audience 
for which it was intended. 

The inadequacies which prelimmary editions always involve were gallantly en- 
dured by a rugged group of freshmen in the honors mathematics course at Brandeis 
University during the academic year 1965-1966. About half of this course was 
devoted to algebra and topology, while the other half covered calculus, with the 
preliminary edition as the text. It is almost obligatory in such circumstances to 
report that the preliminary version was a gratifying success. ‘This is always safe— 
after all, the class is unlikely to rise up in a body and protest publicly—but the 
students themselves, it seems to me, deserve the right to assign credit for the thor- 
oughness with which they absorbed an impressive amount of mathematics. I am 
content to hope that some other students will be able to use the book to such good 
purpose, and with such enthusiasm. 


Haltham, Massachusetts MICHAEL SPIVAK 
lebruary 1967 


PREFACE TO THE SECOND EDITION 


[have often been told that the title of this book should really be something like “An 
Introduction to Analysis,” because the book is usually used in courses where the 
students have already learned the mechanical aspects of calculus—such courses are 
standard in Europe, and they are becoming more common in the United States. 
After thirteen years it seems too late to change the title, but other changes, in 
addition to the correction of numerous misprints and mistakes, seemed called for. 
There are now separate Appendices for many topics that were previously slighted: 
polar coordinates, uniform continuity, parameterized curves, Riemann sums, and 
the use of integrals for evaluating lengths, volumes and surface areas. A few topics, 
like manipulations with power series, have been discussed more thoroughly in the 
text, and there are also more problems on these topics, while other topics, like 
Newton’s method and the trapezoid rule and Simpson’s rule, have been developed 
in the problems. There are in all about 160 new problems, many of which are 
intermediate in difficulty between the few routine problems at the beginning of 
each chapter and the more difficult ones that occur later. 

Most of the new problems are the work of ‘Ted Shifrin. Frederick Gordon 
pointed out several serious mistakes in the original problems, and supplied some 
non-trivial corrections, as well as the neat proof of ‘Theorem 12-2, which took 
two Lemmas and two pages in the first edition. Joseph Lipman also told me 
of this proof, together with the similar trick for the proof of the last theorem in 
the Appendix to Chapter 11, which went unproved in the first edition. Roy O. 
Davies told me the trick for Problem 11-66, which previously was proved only in 
Problem 20-8 [21-8 in the third edition], and Marina Ratner suggested several 
interesting problems, especially ones on uniform continuity and infinite series. ‘To 
all these people go my thanks, and the hope that in the process of fashioning the 
new edition their contributions weren't too badly botched. 


MICHAEL SPIVAK 


PREFACE: FOE THIRD:E DITION 


The most significant change in this third edition ts the inclusion of a new (starred) 
Chapter 17 on planetary motion, in which calculus is employed for a substantial 
physics problem. 

In preparation for this, the old Appendix to Chapter 4 has been replaced by 
three Appendices: the first two cover vectors and conic sections, while polar coor- 
dinates are now deferred until the third Appendix, which also discusses the polar 
coordinate equations of the conic sections. Moreover, the Appendix to Chapter 12 
has been extended to treat vector operations on vector-valued curves. 

Another large change is merely a rearrangement of old material: “Phe Cos- 
mopolitan Integral,” previously a second Appendix to Chapter 13, ts now an 
Appendix to the chapter on “Integration in Elementary ‘Terms” (previously Chap- 
ter 18, now Chapter 19); moreover, those problems from that chapter which used 
the material from that Appendix now appear as problems in the newly placed 
Appendix. 

A few other changes and renumbering of Problems result from corrections, and 
elimination of incorrect problems. 


IT was both startled and somewhat dismayed when I realized that after allow- 
ing 13 years to elapse between the first and second editions of the book, | have 
allowed another 14 years to clapse before this third edition. During this time I 
seem to have accumulated a not-so-short list of corrections, but no longer have 
the original communications, and therefore cannot properly thank the various m- 
dividuals involved (who by now have probably lost interest anyway). I have had 
time to make only a few changes to the Suggested Reading, which after all these 
years probably requires a complete revision; this will have to wait untl the next 
edition, which I hope to make in a more timely fashion. 


MICHAEL SPIVAK 


PREFACE. FO THE FOURTH EDITION 


Promises, promises! In the preface to the third edition I noted that it was 13 years 
between the first and second editions, and then another 14 years before the third, 
expressing the hope that the next edition would appear sooner. Well, here it is 
another 14 years later before the fourth, and presumably final, edition. 

Although small changes have been made to some material, especially m Chap- 
ters 5 and 20, this edition diflers mainly in the introduction of additional problems, 
a complete update of the Suggested Readmg, and the correction of numerous er- 
rors. ‘These have been brought to my attention over the years by, among others, 
Nils von Barth; Philip Loewen; Fernando Mejias; Lance Miller, who provided a 
long list, particularly for the answer book; and Michael Maltenfort, who provided 
an amazingly extensive list of misprints, errors, and criticisms. 

Most of all, however, [ am indebted to my friend Ted Shifrm, who has been 
usmg the book for the text m his renowned course at the University of Georgia 
for all these years, and who prodded and helped me to finally make this needed 
revision. | must also thank the students in his course this last academic year, who 
served as guinea pigs for the new edition, resulting, m particular, m the current 
proof im Problem 8-20 for the Rismg Sun Lemma, far simpler than Reisz’s original 
proof, or even the proof in [38] of the Suggested Reading, which itself has now 
been updated considerably, agam with great help from ‘Ted. 


MICHAEL SPIVAK 
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PROLOGUE 


CHAPTER 


BASIC PROPERTIES OF NUMBERS 


The title of this chapter expresses in a few words the mathematical knowledge 
required to read this book. In fact, this short chapter is simply an explanation of 
what is meant by the “basic properties of numbers,” all of which—addition and 
multiplication, subtraction and division, solutions of equations and inequalities, 
factoring and other algebraic manipulations—are already familiar to us. Never- 
theless, this chapter is not a review. Despite the familiarity of the subject, the 
survey we are about to undertake will probably seem quite novel; it does not aim 
to present an extended review of old material, but to condense this knowledge 
into a few simple and obvious properties of numbers. Some may even seem too 
obvious to mention, but a surprising number of diverse and important facts turn 
out to be consequences of the ones we shall emphasize. 

Of the twelve properties which we shall study in this chapter, the first nine are 
concerned with the fundamental operations of addition and multiplication. For 
the moment we consider only addition: this operation is performed on a pair 
of numbers—the sum a + Db exists for any two given numbers a and b (which 
may possibly be the same number, of course). It might seem reasonable to regard 
addition as an operation which can be performed on several numbers at once, and 
consider the sum a; + --- +a, of m numbers aj,...,@, as a basic concept. It is 
more convenient, however, to consider addition of pairs of numbers only, and to 
define other sums in terms of sums of this type. For the sum of three numbers 
a, b, and c, this may be done in two different ways. One can first add b and c, 
obtaining b +c, and then add a to this number, obtainmg a + (b +c); or one can 
first add a and b, and then add the sum a + b to c, obtaining (a+b) +c. Of 
course, the two compound sums obtained are equal, and this fact is the very first 
property we shall list: 


(Pl) If a,b, and c are any numbers, then 


at+(b+c)=(a+b)+c. 


The statement of this property clearly renders a separate concept of the sum of 
three numbers superfluous; we simply agree that a + b + c denotes the number 
a+(b+c) = (a+b)+c. Addition of four numbers requires similar, though slightly 
more involved, considerations. The symbol a + b + c +d is defined to mean 


l) (4@48)2,¢)+ d, 
(a+(b+c))+d, 


) 
) 
) at+((b+c)+d), 
) 
) 


N 


or 


os) 


Or 
4 
5) 


a--O--(e --¢@)), 
(a+b)+(c+d). 


or 
or 


a ea 
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This definition is unambiguous since these numbers are all equal. Fortunately, his 
fact need not be listed separately, since it follows from the property P1 already 
listed. For example, we know from P1 that 


(a+b)+c=a+(b+c), 
and it follows immediately that (1) and (2) are equal. The equality of (2) and (3) 


is a direct consequence of P], although this may not be apparent at first sight 
(one must let b + c play the role of b in PI, and d the role of c). The equalities 
(3) = (4) = (5) are also simple to prove. 

It is probably obvious that an appeal to P1 will also suffice to prove the equality 
of the 14 possible ways of summing five numbers, but it may not be so clear how we 
can reasonably arrange a proof that this is so without actually listing these 14 sums. 
Such a procedure is feasible, but would soon cease to be if we considered collections 
of six, seven, or more numbers; it would be totally inadequate to prove the equality 
of all possible sums of an arbitrary finite collection of numbers aj,...,@n. This 
fact may be taken for granted, but for those who would like to worry about the 
proof (and it is worth worrying about once) a reasonable approach is outlined in 
Problem 24. Henceforth, we shall usually make a tacit appeal to the results of this 
problem and write sums aj + +--+ a, with a blithe disregard for the arrangement 
of parentheses. 

The number 0 has one property so important that we list it next: 


(P2) If a is any number, then 


a+0=0+a=a. 


An important role is also played by 0 in the third property of our list: 
(P3) For every number a, there is a number —a such that 


a+(-a) = (-a)+a = 0. 


Property P2 ought to represent a distinguishing characteristic of the number 0, 
and it is comforting to note that we are already in a position to prove this. Indeed, 
ifa number x satisfies 

a+x=a 


for any one number a, then x = 0 (and consequently this equation also holds for all 
numbers a). The proof of this assertion involves nothing more than subtracting a 
from both sides of the equation, in other words, adding —a to both sides: as the 
following detailed proof shows, all three properties P]~ P3 must be used to justify 
this operation. 


ie atx=a, 
then (—a) + (a+x)=(-a) +a =0; 
hence ((—a)+a)+x«=0; 
hence O++x = 0; 


hence yo e=i)), 
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As we have just hinted, it is convenient to regard subtraction as an operation 
derived from addition: we consider a — b to be an abbreviation for a + (—b). It 
is then possible to find the solution of certain simple equations by a series of steps 
(each justified by PI, P2, or P3) similar to the ones just presented for the equation 
G+x =q. Torexampc: 


If sao By 
then (x + 3)4+ (-—3) =5+4+ (3); 
hence x+ (34 (—3))=5-3=2; 
hence Seren Op 8 
hence ee 


Naturally, such elaborate solutions are of interest only until you become convinced 
that they can always be supplied. In practice, it is usually just a waste of time to 
solve an equation by indicating so exphcitly the reliance on properties P1, P2, and 
P3 (or any of the further properties we shall list). 

Only one other property of addition remains to be listed. When considering the 
sums of three numbers a, b, and c, only two sums were mentioned: (a + b) +¢ 
and a+ (b+ c). Actually, several other arrangements are obtained if the order of 
a, b, and ¢ is changed. ‘That these sums are all equal depends on 


(P4) If a and b are any numbers, then 


feb = Db a, 


The statement of P4 is meant to emphasize that although the operation of ad- 
dition of pairs of numbers might conceivably depend on the order of the two 
numbers, in fact it does not. It is helpful to remember that not all operations are 
so well behaved. For example, subtraction does not have this property: usually 
a—b#b-—a. In passing we might ask just when a — b does equal b — a, and it 
is amusing to discover how powerless we are if we rely only on properties P1—P4 
to justify our manipulations. Algebra of the most elementary variety shows that 
a—b=b-—a only whena = b. Nevertheless, it is impossible to derive this fact 
from properties P1~ P4; it is instructive to examine the elementary algebra care- 
fully and determime which step(s) cannot be justified by P] -P4. We will indeed 
be able to justify all steps in detail when a few more properties are listed. Oddly 
enough, however, the crucial property involves multiplication. 

The basic properties of multiplication are fortunately so similar to those for ad- 
dition that litte comment will be needed; both the meaning and the consequences 
should be clear. (As in elementary algebra, the product of a and b will be denoted 
by a:b, or simply ab.) 


(P5) If @, b, and c are any numbers, then 


@-(b -c)—(a-p)- co 


(P6) If @ is any number, then 
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Moreover] 40. 


(The assertion that | 4 0 may seem a strange fact to list, but we have to 
list it, because there is no way it could possibly be proved on the basis of the 
other properties listed these properties would all hold if there were only one 
number, namely, 0.) 


(P7) For every number a ¥ 0, there is a number a~! such that 


qet ee) GS 


(P8) If a and b are any numbers, then 


GS ae 


One detail which deserves emphasis 1s the appearance of the condition a 4 0 
in P7. This condition is quite necessary; since 0-b = 0 for all numbers b, there is no 
number 07! satisfying 0-07! = 1. This restriction has an important consequence 
for division. Just as subtraction was defined in terms of addition, so division is 
defined in terms of multiplication: The symbol a/b means a- b~!. Since 07! is 
meaningless, a/0 is also meaningless—division by 0 is always undefined. 

Property P7 has two important consequences. If a+b = a-c, it does not 
necessarily follow that b = c; for if a = 0, then both a-b and a-c are 0, no matter 
what b and c are. However, if a £0, then b = ¢; this can be deduced from P7 as 
follows: 


If a-b 
then ended) 
hence (cia -a)-b 
hence l-b 
hence p=@ 


a-cand g £0: 


ie (a-c); 


(a! a1Gi)e C8 
| Feares 


It is also a consequence of P7 that if a-b = 0, then either a = 0 or b = 0. In fact, 


ii a-b=0 amdig= U, 

then a !-(a-b) =0: 

hence (a7!-a)-b=0: 

hence i 5 "0; 

hence boo 
(It may happen that a = 0 and b = 0 are both true; this possibility is not excluded 
when we say “cither @ = 0 or b = 0°; m mathematics “or” is always used m the 
sense of “one or the other, or both.”) 

This latter consequence of P7 is constantly used m the solution of equations. 

Suppose, for example, that a number x is known to satisfy 


(~ — I(x — 2) = O. 


Then it follows that either x — | = 0 or x —2 = 0: hence x = | or x = 2. 
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On the basis of the eight properties listed so far it is still possible to prove very 
little. Listing the next property, which combines the operations of addition and 
multipheation, will alter this situation drastically. 


(P9) If a, b, and c¢ are any numbers, then 
a:‘(b+c)=a-bt+a-c. 
(Notice that the equation (b+ c)-a=b-a+c-a 1s also true, by P8.) 


As an example of the usefulness of P9 we will now determine just when a — b = 


b-a: 


If a-b=b—-a., 

then (a—b)+b=(b-—a)+b=b4+ (b—a): 
hence a=b+b-a; 

hence ata=(b+b—a)+az=b+b. 
Consequently a-(l+l)=b-(1+1), 

and therefore a 


A second use of P9 is the justification of the asseruon a- 0 = 0 which we have 
already made, and even used m a proof on page 6 (can you find where?). This 
fact was not listed as one of the basic properties, even though no proof was offered 
when it was first mentioned. With Pl P8 alone a proof was not possible, since the 
number Q appears only in P2 and P3, which concern addition, while the assertion 
in question involves multiplication. With P9 the proof is simple, though perhaps 
not obvious: We have 


a-O+a-O0=a-(04+0) 
@q] +: 


as we have already noted, this immediately implies (by adding —(a - 0) to both 
sides) that a- 0 = 0. 

A series of further consequences of P9 may help explain the somewhat myste- 
rious rule that the product of two negative numbers is positive. ‘To begin with, 
we will establish the more easily acceptable assertion that (—a)-b = —(a-b). To 
prove this, note that 


(-—a)-b+a-b=([(—a)+a]-b 
=D 
= 


It follows immediately (by adding —(a-b) to both sides) that (—a)-b = —(a- b). 
Now note that 


(—a) - (—b) + [-(a- b)] = (—a) - (—b) + (—a) +b 
= (—a) - [(—b) + b] 
= (-a):0 
= 0. 
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Consequently, adding (a - b) to both sides, we obtain 
(—a)-(—b) =a-b. 


The fact that the product of two negative numbers is positive is thus a consequence 
of Pl P9. In other words, ¢7fwe want PI to P9 to be true, the rule for the product of two 
negative numbers is forced upon us. 

The various consequences of P9 examined so far, although interesting and im- 
portant, do not really indicate the significance of P9; after all, we could have listed 
each of these properties separately. Actually, P9 is the justification for almost all 
algebraic manipulations. For example, although we have shown how to solve the 
equation 


(x — D& —2)=0, 


we can hardly expect to be presented with an equation in this form. We are more 
likely to be confronted with the equation 


537452 = 0. 


Lye) 
OO 


The “factorization” x“ — 3x +2 = (x — 1)(« — 2) is really a triple use of P9: 


(mere ais (ge ak 

xx +x-(—2)+(-1)-x+(-1)- (-2) 
x? +x[(-2)+(-l] +2 

= heer 2. 


Co selGas 2) 


A final illustration of the importance of P9 is the fact that this property 1s actually 
used every ime one multiplies arabic numerals. For example, the calculation 


13 
Os 


is a concise arrangement for the following equations: 


13 24a 1 WO) 
sien (een ioc! 
= 4. MORES. 


(Note that moving 26 to the left in the above calculation is the same as writing 
26-10.) ‘The multiplication 13 -4 = 52 uses P9 also: 


13-4=(1-10+43)-4 
=1-10-443.-4 
= 4 eet ale 
= 4 = 1) =F alee 4-2 
= (44+ 1)-10+2 
— ef Uae 
=p 
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The properties PI—P9 have descriptive names which are not essential to remem- 
ber, but which are often convenient for reference. We will take this opportunity to 
list properties P1—P9 together and indicate the names by which they are commonly 


designated. 

(P1) (Associative law for addition) el (beeen Ciel ela 

(P2) (Existence of an additive a+0=0+a=a. 
identity) 

(P3) (Existence of additive inverses) a+ (—a) = (-—a)+a=0. 

(P4) (Commutative law for addition) atb=b+a. 

(P5) (Associative law for multiplica- ad -(bD~c) =6Ge DD) ie 
tion) 

(P6) (Existence of a multiplicative @gi= l-G@=¢: ae 
identity) 

(P7) (Existence of multiplicative a-a'=a"!.a=1, fora ¥0. 
inverses) 

(P8) (Commutative law for multi- (mb = ba 
plication) 

(P9) (Distributive law) a-(b+c)=a-b+a-c. 


The three basic properties of numbers which remain to be listed are concerned 
with mequalities. Although inequalities occur rarely in elementary mathematics, 
they play a prominent role in calculus. ‘The two notions of inequality, a < b 
(a is less than b) and a > b (a 1s greater than b), are intimately related: a < b 
means the same as b > a (thus | < 3 and 3 > | are merely two ways of writing 
the same assertion). The numbers a satisfying a > 0 are called positive, while 
those numbers a satisfying a < 0 are called negative. While positivity can thus 
be defined in terms of <, it is possible to reverse the procedure: a < b can be 
defined to mean that b — a is positive. In fact, it is convenient to consider the 
collection of all positive numbers, denoted by P, as the basic concept, and state 
all properties in terms of P: 


(P10) (Trichotomy law) For every number a, one and only one of the 
following holds: 


(al 
(i) a@is in the collection P, 
(il) —a is in the collection P. 


(P11) (Closure under addition) If a and b are in P, then a +b is in P. 


(P12) (Closure under multiplication) If a and b are in P, then a -b is 
nied? 
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These three properties should be complemented with the following definitions: 


a>b i @—B isin P: 

Ge Ooi eb = a 

aap i (ae) O;G.— 
PS NR Os 


Note, in particular, that a > 0 if and only if a isin P. 

All the famihar facts about Inequalities, however elementary they may seem, are 
consequences of P1O-P12. For example, if a@ and b are any two numbers, then 
precisely one of the following holds: 


i) a—-b=0, 
(i) @—b is m the collection P, 
(il) —(a —b) = b—ais im the collection P. 


Using the definitions just made, it follows that precisely one of the following holds: 


(ie a=, 
Gi)" aS", 
(iy Dee Ge 


A slightly more interesting fact results from the followmg manipulations. If 
a < b, so that b—a is in P, then surely (b + c) — (a +c) is in P; thus, if a < D, 
then a+c<b+c. Similarly, suppose a < b and b < c. Then 


pa istn P. 
and c—bisin P, 
sO OP W(t) EN roe ete ee 


This shows that if a < b and b < c, then a < c. (The two inequalities a < b and 
b < care usually written in the abbreviated form a < b < c, which has the third 
inequality a < c almost built in.) 

The following assertion 1s somewhat less obvious: If a < 0 and b < 0, then 
ab > 0. The only difficulty presented by the proof is the unraveling of definitions. 
The symbol a < 0 means, by definition, 0 > a, which means 0 — a = —a 1s in P. 
Similarly —b is m P, and consequently, by P12, (—a)(—b) = ab ism P. Thus 
ab > 0. 

The factthatrab = O1f aa> Os> =) 0 and also @ = 0, baa) hussone 
special consequence: a? > 0 if a 4 0. Thus squares of nonzero numbers are 
always positive, and m particular we have proved a result which might have seemed 
sufficiently elementary to be included in our list of properties: 1 > 0 (since 1 = 1°). 


* There is one slightly perplexing feature of the symbols > and <. ‘The statements 


[481 <3 
pees) 


are both true, even though we know that < could be replaced by < im the first, and by = im the 
second. ‘Vhis sort of thing is bound to occur when < is used with specific numbers: the usefulness 
of the symbol is revealed by a statement like Vheorem |~ -here equality holds for some values of a 
and b, while inequality holds for other values. 


THEOREM 1 


PROOF 
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The fact that -a > 0 if a < 0 is the basis of a concept which will play an 
extremely important role in this book. For any number a, we define the absolute 
value |a| of a as follows: 


a (aed 0] 
la| = 
—a, a<Q0. 


Note that |a| is always positive, except when a = 0. For example, we have | — 3| = 
3G 7 Wich ale Intense ariel dec} Wien NO ey ol 


In general, the most straightforward approach to any problem involving absolute 


values requires treating several cases separately, since absolute values are defined 
by cases to begin with. This approach may be used to prove the following very 
unportant fact about absolute values. 


For all numbers a and b, we have 


la+ b| < |a| + |d|. 


We will consider 4 cases: 


(1) a2 07 bev; 
(2) a eb 0: 
(3) a=0, b>0; 
(4) Gee Ol) Db =O: 


In case (1) we also have a + b > O, and the theorem is obvious; in fact, 
lat+bl|=a+b=|a|+ |b, 


so that in this case equality holds. 
In case (4) we have a + b < 0, and again equality holds: 


la+b| = —-—(a+b) = —a+ (—b) = |a| + |DI. 
In case (2), when a > 0 and b < 0, we must prove that 
lat+bl<a-—b. 
This case may therefore be divided into two subcases. If a+b > 0, then we must 
prove that 
a+b<a-—b, 
1.€., b<—b, 


which is certainly true since b < 0 and hence —b > 0. On the other hand, if 
a+b <0, we must prove that 


—-a-—b<a-—hb, 


= 
= 


1.€, —a<a, 


which is certainly true since a > 0 and hence -a < 0. 


Finally, note that case (3) may be disposed of with no additional work, by apply- 
ing case (2) with a and b mterchanged. J 
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Although this method of treating absolute values (separate consideration of var- 
lous Cases) is sometimes the only approach available, there are often simpler meth- 
ods which may be used. In fact, it is possible to give a much shorter proof of 
Theorem 1; this proof is motivated by the observation that 


la| = Va2. 


(Here, and throughout the book, /x denotes the positive square root of x; this 
symbol is defined only when x > 0.) We may now observe that 


(la + b|)? =(a+ b)- =a? +2ab+b? 
aw + Qallb- ee 
= al? + 2la|- |b] + [bP 
= (\a| + |b})?. 


al : 2: 2 
From this we can conclude that |a + b| < Ja] + |b| because x2 < y~ mmphies x < y, 


provided that x and y are both nonnegative; a proof of this fact is left to the reader 
(Problem 5). 

One final observation may be made about the theorem we have just proved: a 
close examination of either proof offered shows that 


la + b| = |a| + |D| 


if a and b have the same sign (i.e., are both positive or both negative), or if one of 
the two 1s 0, while 
la+b| < |a| + |b| 


if a and b are of opposite signs. 

We will conclude this chapter with a subtle point, neglected until now, whose 
inclusion is required in a conscientious survey of the properties of numbers. After 
stating property P9, we proved that a — b = b—a implies a = b. ‘The proof began 
by establishing that 

a-(lil=b- Cl + ly, 


from which we concluded that a = b. This result is obtained from the equation 
a-(1+1) = b- (1 +1) by dividing both sides by 1 + 1. Division by 0 should 
be avoided scrupulously, and it must therefore be admitted that the validity of the 
argument depends on knowing that 1+ 1 4 0. Problem 25 is designed to convince 
you that this fact cannot possibly be proved from properties Pl P9 alone! Once 
P10, Pll, and P12 are available, however, the proof is very simple: We have 
already seen that | > 0; it follows that 1 + 1 > 0, and in particular 1+ 1 4 0. 
his last demonstration has perhaps only strengthened your fecling that it 1s 
absurd to bother proving such obvious facts, but an honest assessment of our 
present ‘situation will help justify serious consideration of such details. In this 
chapter we have assumed that numbers are familiar objects, and that P1-P12 are 
merely explicit statements of obvious, well-known properties of numbers. It would 
be difficult, however, to justify this assumption. Although one learns how to “work 
with” numbers m school, just what numbers ave, remaims rather vague. A great 
deal of this book is devoted to elucidating the concept of numbers, and by the end 
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of the book we will have become quite well acquainted with them. But it will be 
necessary to work with numbers throughout the book. It 1s therefore reasonable 
to admit frankly that we do not yet thoroughly understand numbers; we may stl 
say that, i whatever way numbers are finally defined, they should certainly have 
properties*? =r is 

Most of this chapter has been an attempt to present convincing evidence that 
Pl P12 are indeed basic properties which we should assume in order to deduce 
other familiar properties of numbers. Some of the problems (which indicate the 
derivation of other facts about numbers from P1-—P12) are offered as further evi- 
dence. It is still a crucial question whether P1—P12 actually account for a// prop- 
erties of numbers. As a matter of fact, we shall soon see that they do not. In the 
next chapter the deficiencies of properties P]—P12 will become quite clear, but 
the proper means for correcting these deficiencies is not so easily discovered. ‘The 
crucial additional basic property of numbers which we are seeking is profound and 
subtle, quite unlike Pl- P12. The discovery of this crucial property will require all 
the work of Part II of this book. In the remainder of Part I we will begin to see 
why some additional property is required; in order to investigate this we will have 
to consider a little more carefully what we mean by “numbers.” 


PROBLEMS 


1. Prove the followmg: 


@ Itiex = ator some inn ber ai 0 then x — I. 

ii) x*—y*=(x—y)@+y). 

Gn) ier — y= them. —y or = —=y, 

(iv) x3 — yi = (x — y)(x? + xy + y?). 

oo) Fee Wie a yee ay ae 

(vi) xi +y? = (x+y)? — xy + y?). (There is a particularly easy way to 


do this, usmg (iv), and it will show you how to find a factorization for 
x" + y" whenever 7 is odd.) 


2. What is wrong with the following “proof”? Let x = y. Then 


ay 
eye 
9) 9) 9] 
x°—y"°=xy— y*, 
Clary) (X= eV OG ay), 
X+y=y, 
2y=y, 
Cs — me 


3. Prove the following: 


a € 


GQ = ic 0. 
be 


4 a C ad+hbec .. 
— + — = —— if b,.d £0. 
(n) b d bd be? 7 
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(iii) (ab)~-' = a~'b"!, if a,b 4 0. (To do this you must remember the 
defining property of (ab)~!.) 


a@ ¢€ “ae 
iv) —-—=—,if b, 0. 
MBG apn ree 

a ads... 
(v) tf i= Sit bed £0 
(vi) If b,d #0, then ; = = if and only if ad = be. Also determine when 

Gs b 

b a 
Find all numbers x for which 
(i) 4—x <3—-—2x. 
i) S— x28, 
(in) St ye = 
(iv) (x — 1)(x — 3) > 0. (When is a product of two numbers positive?) 
(Vi) x2 = 24-2 SO! 
(wi) x2 ex + 1S 2: 
(vii) x*-x+10> 16. 
(viii) x7 +x+1>0. 
(ix) (x—2)(x+5)(x — 3) > 0. 
(x) (x — ¥2)(x — V2) >0. 
(xq) 28: 
(xii) x+3* <4 

] | 
(xin) — + > 0. 

l—x 
—] 

(xiv) etal > 0. 


Prove the following: 


(i) Ifa<bandc <d,thena+c <b+d. 

(ji) If a<b, then —b < —a. 

Gi) Ifa<bandc>d,thena—c <b—d. 

(iv) If a <bandc > 0, then ace < be. 

(v) Ifa<bandc <0, then ac > be. 

(vi) Ifa> 1, thena? >a. 

(vii) If O<a <1, then a? <a. 

(vii) If O< a < band 0 <c <d, then ac < bd. 

(ix) If 0 <a <b, then a? < b?. (Use (viii).) 

(x) If a,b >Oand a? < b?, then a <b. (Use (ix), backwards.) 


(a) -Prove:thatat (0 =< xs<97, thence” <7" Ss ayo y0 -4- 
(b) Prove that if x < y and n is odd, then x” < y". 
(c) Prove that if x” = y" and n is odd, then x = y. 


yn 
(d) Prove that if x” = y” and n is even, then x = y or x = —y. 


EG. 


10. 


Ly 
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Prove that if 0 <a < b, then 
a+b 


a< Oe ee 


Notice that the mequality Vab < (a + b)/2 holds for all a, b > 0. A gener- 
alization of this fact occurs in Problem 2-22. 


Although the basic properties of inequalities were stated in terms of the col- 
lection P of all positive numbers, and < was defined in terms of P, this 
procedure can be reversed. Suppose that P10 P12 are replaced by 


(P’10) For any numbers a and b one, and only one, of the 


following holds: 
(i) “a=; 
(a) a= Dy 
(iui)) Dea 


(P’11) For any numbers a, b, and c, if a < band b < c, then 
GG. | 

(P’12) For any numbers a, b, and c, if a < b, then 
a+c<bte. 

(P’13) For any numbers a, b, and c, if a < b and 0 <c, then 
dG< be 


Show that P]O-P12 can then be deduced as theorems. 


Express each of the following with at least one less pair of absolute value 
signs. 


GQ) w IV 2 try ee NI. 

Gi) (la + | = |a| = |5))I. 

(ii) [(ja + b| + le] = [a+b + cpl. 
(ivy (ae e2 or tele 

( 


vy) 1dv2+ V3) - [45 — V7). 


Express each of the following without absolute value signs, treating various 
cases separately when necessary. 


(i) |a+b|—|dI. 
(a) le) = De 
( 
( 


eee 


in) |x|— ea 


lv) a-—|(a—|a})|. 

Find all numbers x for which 
(i) [x —3| <8. 

(i) |x +4[ <2. 

(wv) |x —1]+ |x -—2] > 1. 
(vv) lx —1|+ |x +1| < 2. 
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£2: 


iS: 


lf 


ale 


eal lity, 


(vi) — hype law + 1] <ade 
(vine tb leyle se b] = 0. 
(vin) |x — 1[- |x + 2] = 3. 


Prove the following: 


@)  lxyl = la] -lyl. 

" | I 3 

(11) Ali Rl if x # 0. (The best way to do this is to remember what 
ball asks) 

Gi) pc laa ia <0. 
1 es 

(iv) |x — y| S |x| + |y|. (Give a very short proof.) 

(v) |x|-—|y| < lx —y|. (A very short proof is possible, if you write things in 
the right way.) 

(vi) [C(x] — |y)| < |x — »|. (Why does this follow mmediately from (v)?) 


(vn) [x +y +2] < |x| + ly| +1zl. Indicate when equality holds, and prove 
your statement. 


The maximum of two numbers x and y is denoted by max(x, y). Thus 
Max(—1, 0) = max(3,5) = 3 and max(—1|,—4) = max(—4-,—1) = —I. 
The minimum of x and y is denoted by min(x, y). Prove that 
part eal 

A ; 


Yo aly — »| 
= : 


~ 


lax.) 


min(x, y) = 
Derive a formula for max(x, vy, z) and min(x, y, Z), using, for example 
eie (12) —inlax er, ax (iy. 2): 


(a) Prove that |a| = |—a|. (The trick is not to become confused by too many 
cases. First prove the statement for a > 0. Why is it then obvious for 
a= 0 

(b) Prove that —b <a < b if and only if |a| < 6. In particular, it follows 
that —|a| <a < al. 

(c) Use this fact to give a new proof that |a + b| < |a| + ||. 


Prove that if x and y are not both 0, then 
ae xv + ye (0) 
ree Eee ee Ey? = 0) 
Hint: Use Problem I. 
(a) Show that 


: 2 D 2) 
Ce y) tres + F° wonly Whenw = 0 or » = 0, 


3 es 
(s+ y)F =x? + ye Yonkewhen's = 0O.of » =0.cmR = Sy. 


II 


IW 


18. 


19. 
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Using the fact that 
ae i Oe 


show that 4x? + 6xy + 4y? > 0 unless x and y are both 0. 

Use part (b) to find out when (x + y)* = x74 yt. 

Find out when (x ey) =, aye. Hint: From the assumption (e-ky)° = 
pow you should be able to derive the equation x ye ey a 
0, if xy £0. This imphes that (x + yp =a2yt xy? =xyv(xt+y). 


You should now be able to make a good guess as to when (x + y)" = x"-+ y"; 
the proof 1s contained in Problem 11-63. 


(a) 


Find the smallest possible value of 2x7 — 3x +4. Hint: “Complete the 
square,” ie., write 2x? — 3x + 4 = 2(x — 3/4)? +? 

Find the smallest possible value of x7 — 3x + 2y? + 4y + 2. 

Find the smallest possible value of poe ay yee oy 4 — Ge 


Suppose that b? — 4c > 0. Show that the numbers 


—b fal b? S46 ee 
——)> ee es 


2 


both satisfy the equation x7 + bx +c = 0. 

Suppose that b? — 4c < 0. Show that there are no numbers x satisfying 
x* + bx +c = 0; in fact, x7 + bx +c > 0 for all x. Hint: Complete the 
square. 

Use this fact to give another proof that if x and y are not both 0, then 
ORS pei S810), 

For which numbers a is it true that x? + axy + y? > 0 whenever x and 
y are not both 0? 

Find the smallest possible value of x? + bx + and of ax? + bx +c, for 
a> U: 


yy ~ . . 
The fact that a~ > O for all numbers a, elementary as it may seem, 1s 


nevertheless the fundamental idea upon which most important inequali- 


ties are ultimately based. ‘The great-granddaddy of all inequalities is the 
Schwarz mequality: 


XSi MD? = VM ae XO” VY Vi yo 


(A more general form occurs in Problem 2-21.) The three proofs of the 

Schwarz inequality outlined below have only one thing in common —their 
: 2 D) 

reliance on the fact that a~ > 0 for all a. 


(a) 


Prove that if x; = Ay; and x2 = Ay2 for some number A > 0, then 
equality holds in the Schwarz inequality. Prove the same thing if y; = 
y2 = 0. Now suppose that y; and y2 are not both 0, and that there is no 
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number A such that x; = Ay; and x2 = Ay2. Then 


Ory — tp Ge 
= A7(y,7 + yo E— 2AM EEE yo) + Cele we) 


Using Problem 18, complete the proof of the Schwarz inequality. 
Prove the Schwarz inequality by using 2xy < x*+y? (how is this derived?) 
with 

Xj Vi 


oe, i —<——_—__—_—-t 
mtd) e), wy 4) 
ES Cea ea v Viegas: 


first for 7 = | and then for 7 = 2. 


i= 


Prove the Schwarz inequality by first proving that 
2 2 ? 9 2 
(x17 + x27)? + y27) = iy + x292)* + Oriy2 — 1291)”. 


Deduce, from each of these three proofs, that equality holds only when 
yy = y2 = 0 or when there is a number A > O such that +) = Ay; and 
XO = 1). 


In our later work, three facts about inequalities will be crucial. Although proofs 


will be supplied at the appropriate pomt in the text, a personal assault on these 
problems 1s infinitely more enlightening than a perusal of a completely worked-out 
proof. ‘The statements of these propositions involve some weird numbers, but their 


basic message is very simple: if x 1s close enough to x9, and y ts close enough to yo, 
then x+y will be close to x9 + yo, and xy will be close to xgy9, and 1/y will be close 
to 1/y9. ‘The symbol “e” which appears in these propositions is the fifth letter of the 
Greek alphabet (“epsilon”), and could just as well be replaced by a less intimidating 


Roman letter; however, tradition has made the use of €¢ almost sacrosanct in the 
contexts to which these theorems apply. 


20. 


ea) 


Prove that if 


ie) 


E 
|v — xg| < 5 and |y — yol < 


- 


9” 


then 


I(x + y) — Go + Yo)| <€. 
I(x — y) — (Xo — Yo)| < &. 


Prove that if 


E€ 


; E 
2(\yol +1)” 


and |y — yl < ————— 
y= Jo Zola) 


|x — xo| < min ( 


then |xy — x9yo] < €. 


(The notation “min” was defined in Problem 13, but the formula provided by 


that problem is irrelevant at the moment; the first equality in the hypothesis 


just means that 


~ 


ie, ee Sy ae 
2(l¥o| + 1) | | 


Le = aol" = 


2. 


423. 


ge 
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at one point in the argument you will need the first inequality, and at an- 
other point you will need the second. One more word of advice: since the 
hypotheses only provide information about x — x9 and y — yg, it is almost a 
foregone conclusion that the proof will depend upon writing xy — xoyo in a 
way that involves x — x9 and y — yo.) 


Prove that if yo 4 0 and 


ly — yo] < min Lvol elyol’ 
- YO y) 9 2) ’ 
then y 4 0 and 

1 ] 

———|<e 

y 0 


Replace the question marks in the following statement by expressions involv- 
ing €, x9, and yo so that the conclusion will be true: 


If yo 4 O and 
ly _— vo| <a 2 and |x Few xo = : 


then y 4 0 and 


hem yp 


This problem is trivial in the sense that its solution follows from Problems 21} 
and 22 with almost no work at all (notice that x/y = x -1/y). The crucial 
point is not to become confused; decide which of the two problems should 
be used first, and don’t panic if your answer looks unlikely. 


This problem shows that the actual placement of parentheses in a sum is 
irrelevant. The proofs involve “mathematical induction”; if you are not fa- 
miliar with such proofs, but still want to tackle this problem, it can be saved 
until after Chapter 2, where proofs by induction are explained. 

Let us agree, for definiteness, that ay +--+ +a, will denote 


ay sp (a2 ot (a3 sta ate (dn-2 he (Ay—1 dr an))) a ee i) 


Thus a; + a2 + a3 denotes ay + (a2 + a3), and ay + az + a3 + a4 denotes 
GY As tea G3 4), CIC, 


(a) Prove that 
(aq a a) oe 


Hint: Use induction on k. 
(b) Prove that if n > k, then 


(Gila te et eee Oy) =F [ictal Gy. 


Hint: Use part (a) to give a proof by induction om k. 
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(c) Let s(ay,....a,) be some sum formed from aj,..., ax. Show that 
S(Q], 2-65 Ay) = a] +--+ - + ay. 


Hint: There must be two sums s‘(a),...,a;) and s”(aj;44,..., ax) such 
that 


S(GYs,.-0 AR) HS (Aig one BE) AS GP: 5 QR): 


25. Suppose that we interpret “number” to mean either 0 or 1, and + and - to 
be the operations defined by the following two tables. 


Ae 0 l 0 l 
SRAGIe. 
l | 0 l 


Check that properties P1—P9 all hold, even though 1 + 1 = 0. 


CHAPTER 


NUMBERS OF VARIOUS SORTS 


In Chapter | we used the word “number” very loosely, despite our concern with 
the basic properties of numbers. It will now be necessary to distinguish carefully 
various kinds of numbers. 


The simplest numbers are the “counting numbers” 


The fundamental significance of this collection of numbers is emphasized by its 
symbol N (for natural numbers). A brief glance at PI P12 will show that our 
basic properties of “numbers” do not apply to N—for example, P2 and P3 do not 
make sense for N. From this point of view the system N has many deficiencies. 
Nevertheless, N is sufficiently important to deserve several comments before we 
consider larger collections of numbers. 

The most basic property of N is the principle of “mathematical induction.” 
Suppose P(x) means that the property P holds for the number x. Then the prin- 
ciple of mathematical induction states that P(x) is true for all natural numbers x 
provided that 


(Ly 2d |i) as tieues 
(2) Whenever P(k) is true, P(k + 1) is true. 


Note that condition (2) merely asserts the truth of P(k+1) under the assumption 
that P(k) is true; this suffices to ensure the truth of P(x) for all x, if condition 
(1) also holds. In fact, if P(1) is true, then it follows that P(2) is true (by using 
(2) in the special case k = 1). Now, since P(2) is true it follows that P(3) is true 
(using (2) in the special case k = 2). It is clear that each number will eventually be 
reached by a series of steps of this sort, so that P(x) is true for all numbers k. 

A favorite illustration of the reasoning behind mathematical induction envisions 
an infinite line of people, 


person number 1, person number 2, person number 3, ... . 


If each person has been instructed to tell any secret he hears to the person behind 
him (the one with the next largest number) and a secret is told to person number I, 
then clearly every person will eventually learn the secret. If P(x) is the assertion 
that person number x will learn the secret, then the mstructions given (to tell all 
secrets learned to the next person) assures that condition (2) is true, and telling 
the secret to person number | makes (1) true. The following example is a less 
facetious use of mathematical induction. There ts a useful and striking formula 
which expresses the sum of the first 2 numbers in a simple way: 


21 
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n(n + 1) 
=k 


= 


| eoweae = 


‘To prove this formula, note first that it 1s clearly true for n = 1. Now assume that 
for some natural number k we have 


_ kk ED 


aera genet 
feet ; 


Then 


nee er ere me OSU 
me pa alee 
7 p) 
ee fe ey ne 
2 
_ k+ IK + 2) 
2 


5 


so the formula is also true for k + 1. By the principle of induction this proves 
the formula for all natural numbers n. ‘This particular example illustrates a phe- 
nomenon that frequently occurs, especially in connection with formulas like the 
one just proved. Although the proof by induction is often quite straightforward, 
the method by which the formula was discovered remains a mystery. Problems 5 
and 6 indicate how some formulas of this type may be derived. 


The principle of mathematical induction may be formulated in an equivalent 
way without speaking of “properties” of a number, a term which is sufficiently 
vague to be eschewed in a mathematical discussion. A more precise formulation 
states that if A is any collection (or “set”-—a synonymous mathematical term) of 
natural numbers and 


() distin, 


(2). k+11sin A whenever k is in A, 


then A is the set of all natural numbers. It should be clear that this formulation 
adequately replaces the less formal one given previously—we Just consider the 
set A of natural numbers x which satisfy P(x). For example, suppose A is the set 
of natural numbers n for which it is true that 


n(n +1) 
nae 


= 


ee 


Our previous proof of this formula showed that A contains 1, and that k + I 1s 
in A, if kis. It follows that A is the set of all natural numbers, i.c., that the formula 
holds for all natural numbers 7. 

There is yet another rigorous formulation of the principle of mathematical m- 
duction, which looks quite different. If A is any collection of natural numbers, it 
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is tempting to say that A must have a smallest member. Actually, this statement 
can fail to be true in a rather subtle way. A particularly important set of natural 
numbers is the collection A that contains no natural numbers at all, the “empty 
collection” or “null set,’* denoted by #. The null set 9 is a collection of natural 
numbers that has no smallest member—in fact, it has no members at all. ‘This 
is the only possible exception, however; if A is a nonnull set of natural numbers, 
then A has a least member. This “intuitively obvious” statement, known as the 
“well-ordering principle,” can be proved from the principle of induction as follows. 
Suppose that the set A has no least member. Let B be the set of natural numbers 
n such that 1,..., are all not in A. Clearly 1 is in B (because if 1 were in A, then 
A would have | as smallest member). Moreover, if 1,....& are not in A, surely 
k + 1 is not in A (otherwise k + | would be the smallest member of A), so 1,... , 
k +1 are all not in A. This shows that if k is in B, then k + 1 is in B. It follows 
that every number v 1s in B, 1.e., the numbers 1,... , 2 are not in A for any natural 
number n. Thus A = 4, which completes the proof. 

It is also possible to prove the principle of induction from the well-ordering 
principle (Problem 10). Either principle may be considered as a basic assumption 
about the natural numbers. 

There is still another form of induction which should be mentioned. It some- 
times happens that in order to prove P(k + 1) we must assume not only P(k), but 
also P(/) for all natural numbers / < k. In this case we rely on the “principle of 
complete mduction”: If A is a set of natural numbers and 


(1) 1 isan_A, 
(2) Ka ihis meA it ion... « are in A, 


then A is the set of all natural numbers. 

Although the principle of complete induction may appear much stronger than 
the ordinary principle of induction, it is actually a consequence of that principle. 
The proof of this fact is left to the reader, with a hint (Problem 11). Applications 
will be found in Problems 7, 17, 20 and 22. 

Closely related to proofs by induction are “recursive definitions.” For example, 
the number n! (read “n factorial”) is defined as the product of all the natural 
numbers less than or equal to n: 


rere. Gt = [<n 
This can be expressed more precisely as follows: 


q) w= 


l 
(2) ni=n-(n—JD)!. 


This form of the definition exhibits the relationship between n! and (n — 1)! in an 


* Although it may not strike you as a collection, in the ordinary sense of the word, the null set arises 
quite naturally in many contexts. We frequently consider the set A, consisting of all x satisfying some 
property P; often we have no guarantee that P is satisfied by any number, so that A might be A in 
fact often one proves that P is always false by showing that A = ¥. 
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explicit way that 1s ideally suited for proofs by duction. Problem 23 reviews a 
definition already famihar to you, which may be expressed more succinctly as a re- 
cursive definition; as this problem shows, the recursive definition is really necessary 
for a rigorous proof of some of the basic properties of the definition. 

One definition which may not be familiar involves some convenient notation 
which we will constantly be using. Instead of writing 


ON aR oa ap Oy. 


we will usually employ the Greek letter © (capital sigma, for “sum’”’) and write 


i 
> ai. 
i=l 
W 
In other words, > ai denotes the sum of the numbers obtained by letting 
i=l 


i= 1,2.....m. Thus 


H 


| 
Sie Sabine: ees 


fl 


Hn 
Notice that the letter 7 really has nothing to do with the number denoted by - i, 
i— 


and can be replaced by any convenient symbol (except n, of course!): 


n 


tt OG 1) 
pny wer ee 
a 
e Pena) 

/ = 2 ’ 
j=l 
i or 

Sue) 
= 
n=!| 


n 


‘To define ) a; precisely really requires a recursive definition: 


i=l 
| 
(yy a= ait 
i=l 
n n—| 
(2) > 4; = > ai + ay. 
i=l i= 


But only purveyors of mathematical austerity would imsist too strongly on sich 
y | ) \ gi 

precision. In practice, all sorts of modifications of this symbolism are used, and 

no one ever considers it necessary to add any words of explanation. ‘Phe symbol 
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for example, 1s an obvious way of writing 
Gs i OS ato °° I On, 


or more precisely, 


Yai + ie 


ill (=5) 


The deficiencies of the natural numbers which we discovered at the beginning 
of this chapter may be partially remedied by extending this system to the set of 
integers 

=2,—F0, 1,2,..5. 


‘This set is denoted by Z (from German “Zahl,” number). Of properties PI—P12, 
only P7 fails for Z. 

A stll larger system of numbers is obtained by taking quotients m/n of integers 
(with n 4 0). ‘These numbers are called rational numbers, and the set of all 
rational numbers is denoted by Q (for “quotients”). In this system of numbers all 
of Pl P12 are true. It is tempting to conclude that the “properties of numbers,” 
which we studied in some detail in Chapter 1, refer to just one set of numbers, 
namely, Q. There is, however, a still larger collection of numbers to which proper- 
ties PI—P12 apply—the set of all real numbers, denoted by R. The real numbers 
include not only the rational numbers, but other numbers as well (the irrational 
numbers) which can be represented by infinite decimals; 1 and V2 are both 
examples of irrational numbers. The proof that z is irrational is not easy - we 
shall devote all of Chapter 16 of Part HI to a proof of this fact. The irrationality 
of V2. on the other hand, is quite simple, and was known to the Greeks. (Since the 
Pythagorean theorem shows that an isosceles right triangle, with sides of length 1, 
has a hypotenuse of length /2, it is not surprising that the Greeks should have 
investigated this question.) ‘The proof depends on a few observations about the 
natural numbers. Every natural number n can be written either in the form 2k 
for some integer k, or else in the form 2k + | for some integer & (this “obvious” 
fact has a simple proof by induction (Problem 8)). Those natural numbers of the 
form 2k are called even; those of the form 2k + | are called odd. Note that even 
numbers have even squares, and odd numbers have odd squares: 


(2k)? = 4k? = 2. (2k*), 
(2k + 1)? = 4k? + 4k 4+ 1 = 2- (2k? 42k) +1. 
In particular it follows that the converse must also hold: if n? is even, then 7 is even: 


“cf ane . + . . . . . . 
if n“ is odd, then n is odd. ‘The proof that V2 is irrational is now quite simple. 


Suppose that J/2 were rational; that is, suppose there were natural numbers p 
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and q such that 


Py =2 
q 


We can assume that p and qg have no common divisor (since all common divisors 
could be divided out to begin with). Now we have 
3) 3 
po =2q° 
44° Oh . + 
This shows that p~ is even, and consequently p must be even; that is, p = 2k for 
some natural number k. Then 


2k? =q? 

This shows that g* is even, and consequently that g is even. Thus both p and g 
are even, contradicting the fact that p and g have no common divisor. This 
contradiction completes the proof. 

It is important to understand precisely what this proof shows. We have demon- 
strated that there is no rational number x such that x7 = 2. This assertion is often 
expressed more briefly by saying that V2 is irrational. Note, however, that the 
use of the symbol J/2 imphies the existence of some number (necessarily irrational) 
whose square is 2. We have not proved that such a number exists and we can as- 
sert confidently that, at present, a proof is impossible for us. Any proof at this stage 
would have to be based on P1~P12 (the only properties of R we have mentioned): 
since Pl P12 are also true for Q the exact same argument would show that there 
is a rational number whose square is 2, and this we know is false. (Note that the 
reverse argument will not work ——our proof that there 1s no rational number whose 
square is 2 cannot be used to show that there 1s no real number whose square is 2, 
because our proof used not only Pl- P12 but also a special property of Q, the fact 
that every number in Q can be written p/q for mtegers p and q.) 

This particular deficiency in our list of properties of the real numbers could, 
of course, be corrected by adding a new property which asserts the existence of 
square roots of positive numbers. Resorting to such a measure is, however, neither 
aesthetically pleasing nor mathematically satisfactory; we would sull not know that 
every number has an ath root if # is odd, and that every positive number has an 
nth root if 7 is even. Even if we assumed this, we could not prove the existence of 
a number x satisfying x? +x + 1 = 0 (even though there does happen to be one), 
since we do not know how to write the solution of the equation in terms of nth 
roots (in fact, 1t is known that the solution cannot be written m this form). And, 
of course, we certaily do not wish to assume that all equations have solutions, 
since this is false (10 real number x satisfies x? + 1 = 0, for example). In fact, 
this direction of investigation is not a fruitful one. ‘Phe most useful hints about the 
property distinguishmg R from Q, the most compelling evidence for the necessity 
of elucidating this property, do not come from the study of numbers alone. In 
order to study the properties of the real numbers 1 a more profound way, we 
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must study more than the real numbers. At this point we must begin with the 
foundations of calculus, in particular the fundamental concept on which calculus 
is based—functions. 


PROBLEMS 
1. Prove the following formulas by induction. 


(i) ae eee n(n + 1)(2n + 1) 
\"/ — 6 . 


Gi) 134---+n3=(1+4+---+4n)?. 
2. Find a formula for 
(@) 0-1) =143454-+++ Qn—1). 
i=1 


ji) (i —1)? = 1° +3? +5? +--+ + (Qn - 1), 

=I 
Hint: What do these expressions have to do with 1 + 2+3-+---+2n and 
1? 4 22 4 324---+4(2n)?? 


‘ ; ; Ss ps Pees : 
3. IfO<k <n, the “binomial coefficient (;) is defined by 


‘ae n! 2 eS een Sth). ore 
k k'(n — k)! k! 


1 


" n : ; ; : . | 
( ) = ( ) = | (a special case of the first formula if we define 0! = 1), 


and for k < 0 or k > n we just define the binomial coefficient to be 0. 


Cra @ 


(The proof does not require an induction argument.) 


(a) Prove that 


‘This relation gives rise to the following configuration, known as “Pas- 
cal’s triangle’—a number not on one of the sides is the sum of the two 


: ‘ , 2%. n\, 
numbers above it; the binomial coefficient , is the (k + 1)st number 


in the (2 + 1)st row. 
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(b) Notice that all the numbers in Pascal’s triangle are natural numbers. Use 
: . ny. , 
part (a) to prove by induction that a he always a natural number. (Your 


entire proof by induction will, in a sense, be summed up in a glance by 
Pascal’s triangle.) 


Neer < Tes : 
(c) Give another proof that (;) is a natural number by showing that 


iy 


TEN . ; : 
is the number of sets of exactly k integers each chosen from 1, 
ey he 


(d) Prove the “binomial theorem”: If a and b are any numbers and nis a 
natural number, then 


(a+b)" =a"+ (arto + (5 anv? eS ( y jab + h" 
z i 


(e) Prove that 


@ 5()-() 


l odd 


4. (a) Prove that 


o(")(, my) z : fig! 


Hint: Apply the binomial theorem to (1 + «)"(1 +x)”. 


(b) Prove that 
= 
5 (Fi (;") 
RJ. NON 


sagt! 
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(a) Prove by induction on n that 


s ie prt! 
L+rtre peer? = 
l =r 
if r #1 Gif r = 1, evaluating the sum certainly presents no problem). 
(b) Derive this result by setting § = 1+r+---+r", multiplying this equation 
by r, and solving the two equations for S. 
The formula for 17 +--+ +n? may be derived as follows. We begin with the 
formula 
) 
(k+1)? —b = 3k? 43k +1. 


Writing this formula for k = 1, ... , mn and adding, we obtain 


(n+ 1)? —n? =3 <n? +32 +1 
(n+1)3—1 =3[1?+---+n7]43[14+---+a] +n. 


n n 
Tr . Fy < 
Thus we can find ) k~ if we already know ) k (which could have been 


k=1 k=1 
found in a similar way). Use this method to find 


i) 2 4---4n3. 

(ji) 14+4+---4n% 

et ee ee es ee 
(faz 2 n(n + 1) 

~~ 3 5 2n + 1 

Av) 72-22) 2.320 my TT eh 


n 
Use the method of Problem 6 to show that ) k? can always be written in 
i=l 


the form 


+ An? + Bn senP- A wo 


(The first 10 such expressions are 
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11. 


12. 


k=l 
n 
. 
ye = 1n? + in + =2n 
k= 
n 
is 
Scud = qn +5n 4+ qn 
k=] 
Hl 
qe les 14 News 
yok =sn +5 + 3n” —aqn 
k= 
n 
a 
Daike = in + Ln ao sn — pn 
eal 


z= 
n 
eee ma Le + bn® — Ant + bn? 
k=1 
n 
§ Il lees 2°] "Pe aoe 3) ae 
am = 9! 450 = gil st + gn 397! 
k= 
n 
Omee 110 ,mieo , Seto, 124 To} 
pu: =joh +an + 4M ig + 57 mg 
k=] 
n 
Sok = fall t gal? + 2n? Eg Wi eee 
11 D 6 2 66 
[= 


Notice that the coefficients in the second column are always 5. and that after 
the third column the powers of 2 with nonzcro coefficients decrease by 2 until 
n> or n is reached. ‘The coeflicients in all but the first two columns seem to 
be rather haphazard, but there actually is some sort of pattern; finding it may 
be regarded as a super-perspicacity test. See Problem 27-17 for the complete 


story.) 
Prove that every natural number is either even or odd. 


Prove that if a set A of natural numbers contains ng and contains k + | 
whenever it contains k, then A contains all natural numbers > no. 


Prove the principle of matheniatical induction from the well-ordering prin- 

ciple. 

Prove the principle of complete induction from the ordinary principle of 

induction. Hint: IH A contains | and A contains 2+ 1 whenever it contains 

l,...,7, consider tle.sete? of all k such that ],.... k& are all n A. 

(a) If a@ is rational and 6 is irrational, is a + b necessarily irrational? What 
if a and b are both irrational? 


13. 


14. 


15. 


16. 


al 7. 
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(b) If a is rational and b is irrational, 1s ab necessarily irrational? (Careful!) 

(c) Is there a number a such that a? is irrational, but a? is rational? 

(d) Are there two irrational numbers whose sum and product are both ra- 
tional? 


(a) Prove that V3, J5, and V6 are irrational. Hint: To treat V3, for exam- 
ple, use the fact that every integer is of the form 3” or 3” + 1 or 3a 4-2. 
Why doesn’t this proof work for V4? 

(b) Prove that V2 and V3 are irrational. 


Prove that 
(a) J/2 + V6 is irrational. 
(b) V2 + V3 is irrational. 


(a) Prove that if x = p+ ./q where p and q are rational, and m is a natural 
number, then x” = a+ b,/q for some rational a and b. 


(b) Prove also that (p — /g)” =a—b/q. 


: Dae) 
(a) Prove that if m and nv are natural numbers and m“/n°“ < 2, then 
9} Ds 
(m + 2n)-/Gn +n)° > 2; show, moreover, that 


Baa, 2 
AP 2 m 
Gis ee = 
(m +1n)- ne 
(b) Prove the same results with all inequality signs reversed. 


(c) Prove that if m/n < J/2, then there is another rational number m’/n’ 
with m/n <m'/n' < V2. 


It seems hkely that J/n is irrational whenever the natural number 7 is not 
the square of another natural number. Although the method of Problem 13 
may actually be used to treat any particular case, it is not clear in advance 
that it will always work, and a proof for the general case requires some extra 
information. A natural number p is called a prime number if it is impos- 
sible to write p = ab for natural numbers a and b unless one of these is p, 
and the other 1; for convenience we also agree that | is nof a prime number. 
‘Rhe Srs@tewaprmesmamabers are 2, 3,5; 7, 11, 13,97, 19. lf n> lisnota 
prime, then 2 = ab, with a and b both < n; if either a or b is not a prime it 
can be factored similarly; continuing in this way proves that we can write 1 
ds aiproduct of primes. Forexample, 28 = 4-7 =2-2-7. 


(a) ‘Turn this arguinent into a rigorous proof by complete induction. (To 
be sure, any reasonable mathematician would accept the informal argu- 
ment, but this is partly because it would be obvious to her how to state 
it rigorously.) 


A fundamental theorem about integers, which we will not prove here, states 
that this factorization is unique, except for the order of the factors. hus, 
for example, 28 can never be written as a product of primes one of which 
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*18. 


19. 


20. 


is 3, nor can it be written na way that mvolves 2 only once (now you should 
appreciate why | is not allowed as a prime). 


(b) 


(c) 
(d) 


Using this fact, prove that J/1 is irrational unless n = m? for some natural 
number m. 

Prove more generally that </n is irrational unless n = m*. 

No discussion of prime numbers should fail to allude to Euchid’s beautiful 
proof that there are infinitely many of them. Prove that there cannot be 
only finitely many prime numbers pj, p2, P3, ---, Px by considering 


P| . P2 Omri a Pn + ig 
Prove that if x satisfies 
eRe) ey =O) 


for some integers d,_1,... , 4g, then x Is Irrational unless x is an integer. 
(Why is this a generalization of Problem 17?) 


Prove that W6 — /2 — V3 is irrational. 


Prove that /2 + ¥2 is irrational. Hint: Start by working out the first 6 
powers of this number. 


Prove Bernoulli’s inequality: If h > —1, then 


(l+h)" >1}+nh 


for any natural number n. Why 1s this trivial if 4 > 0? 


The Fibonacci sequence ay, a2, a3,... 1s defined as follows: 


eal 
fee —all. 
Ay = @,=\\ + R22 for: = 3). 


‘This sequence, which begins |, 1, 2, 3,5, 8, ... , was discovered by Fibonacct 
(circa 1175— 1250), in connection with a problem about rabbits. Fibonacci 
assumed that an initial pair of rabbits gave birth to one new pair of rabbits 


per month, and that after two months each new pair behaved similarly. ‘The 


number a, of pairs born in the nth month is ay,—1 + d,—2, because a pair of 


rabbits is born for each pair born the previous month, and moreover each 


pair born two months ago now gives birth to another pair. The number of 


interesting results about this sequence ts truly amazing—there is even a }1- 
bonacci Association which publishes a journal, The fibonacct Quarterly. Prove 


that 


1+V5\" (1-Vv5 ' 


2 2 


V5 


an = 


One way of deriving this astonishing formula is presented in Problem 24-16. 


21. 


pas 


De, 
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The Schwarz inequality (Problem 1-19) actually has a more general form: 


Give three proofs of this, analogous to the three proofs in Problem 1-19. 


The result in Problem 1-7 has an important generalization: If aj....,an > 0, 
then the “arithmetic mean” 


H 


and “geometric mean” 


satisfy 


(a) Suppose that ay < A,. Then some a; satisfies aj > An; for convenience, 
say a2 > A,. Let ay = A, and let ay = a, + a2 — a). Show that 


ajar > ajaz. 


Why does repeating this process enough times eventually prove that G, < 
An? (This is another place where it is a good exercise to provide a formal 
proof by induction, as well as an informal reason.) When does equality 
hold in the formula G,, < A,? 


The reasoning in this proof'is related to another interesting proof. 


< A, when n = 2, prove, by induction on k, that 


(b) Using the fact that G, 
G, < An forn =2*. 
(c) For a general n, let 2” >. Apply part (b) to the 2” numbers 


Gyro Hew ayia capes p 
2” —n times 


to prove that Gy, < An. 


The following is a recursive definition of a”: 


] 


ca— cd, 
| 
Ge = aed 
Prove, by induction, that 
gel — a qa 
(a) = Gee 


(Don't try to be fancy: use either induction on n or induction on m, not both 
at once.) 
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hPIGURE 


] 


i) 


aa, 


Zo: 


26. 


Suppose we know properties Pl and P4 for the natural numbers, but that 
multiplication has never been mentioned. Then the following can be used 
as a recursive definition of multiplication: 


eye, Rem 
(a+1)-b=a-b+hb. 


Prove the following (in the order suggested)!): 


a:-(b+c)=a-b+a-c (use induction on a), 
g- | — ae 
a+b =b-a (you just finished proving the case b = 1). 


In this chapter we began with the natural numbers and gradually built up to 
the real numbers. A completely rigorous discussion of this process requires 
a little book in itself (see Part V). No one has ever figured out how to get to 
the real numbers without going through this process, but if we do accept the 
real numbers as given, then the natural numbers can be defined as the real 
numbers of the form 1, 1+ 1, 1+1+1, ete. The whole point of this problem 
is to show that there is a rigorous mathematical way of saying “etc.” 


(a) A set A of real numbers is called inductive if 


(Ql) alims mm A, 
(2) &-+1 isin A whenever k is in A. 


Prove that 


(i) Ris inductive. 
(i) ‘The set of positive real numbers is inductive. 
(in) ‘The set of positive real numbers unequal to 5 is inductive. 
(iv) ‘The set of positive real numbers unequal to 5 is not inductive. 
(v) If A and B are inductive, then the set C of real numbers which 
are in both A and B 1s also inductive. 
(b) Areal number 7 will be called a natural number if 7 1s in every inductive 
Serr 


(i) Prove that | is a natural number. 
(ii) Prove that k + | is a natural number if & is a natural number. 


There is a puzzle consisting of three spindles, with # concentric rings of 
decreasing diameter stacked on the first (Figure 1). A ring at the top of a 
stack may be moved from one spindle to another spindle, provided that it 
is not placed on top of a smaller ring, For example, if the smallest ring 1s 
moved to spindle 2 and the next-smallest ring is moved to spindle 3, then 
the smallest ring may be moved to spindle 3 also, on top of the next-smallest. 
Prove that the entire stack of m rmgs cau be moved onto spindle 3 in 2” — | 
moves, and that this cannot be done in fewer than 2" — | moves. 


P27 


2G. 
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University B. once boasted 17 tenured professors of mathematics. ‘Tradi- 
tion prescribed that at their weekly luncheon meeting, faithfully attended by 
all 17, any members who had discovered an error in their published work 
should make an announcement of this fact, and promptly resign. Such an an- 
nouncement had never actually been made, because no professor was aware 
of any errors in her or his work. This is not to say that no errors existed, 
however. In fact, over the years, in the work of every member of the de- 
partment at least one error had been found, by some other member of the 
department. ‘This error had been mentioned to all other members of the 
departnent, but the actual author of the error had been kept ignorant of the 
fact, to forestall any resignations. 

One fateful year, the department was augmented by a visitor from another 
university, one Prof. X, who had come with hopes of being offered a perma- 
nent position at the end of the academic year. Naturally, he was apprised, by 
various members of the department, of the published errors which had been 
discovered. When the hoped-for appointment failed to materialize, Prof. X 
obtained his revenge at the last luncheon of the year. “Thave enjoyed my visit 
here very much,” he said, “but I feel that there is one thing that I have to tell 
you. At least one of you has published an incorrect result, which has been 
discovered by others in the department.” What happened the next year? 


After figuring out, or looking up, the answer to Problem 27, consider the fol- 
lowing: Each member of the department already knew what Prof. X asserted, 
so how could his saying it change anything? 
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PART 9) 


FOUNDATIONS 


The statement is so frequently made 
that the differential calculus deals with 
continuous magnitude, and yet 

an explanation of this continuaty is 
nowhere given; 

even the most rigorous expositions 

of the differential calculus do not base 
their proofs upon continuity but, 

with more or less consciousness of the fact, 
they either appeal to geometric notions 
or those suggested by geometry, 

or depend upon theorems which are never 
established in a purely arithmetic manner. 
Among these, for example, 

belongs the above-mentioned theorem, 
and a more careful investigation 
convinced me that this theorem, or 

any one equivalent to it, can be regarded 
in some way as a sufficient basis 

for infinitesimal analysis. 

It then only remained to discover its true 
origin in the elements of arithmetic 

and thus at the same time 

to secure a real definition of 

the essence of continuity. 

I succeeded Nov. 24, 1858, and 

a few days afterward I communicated 
the results 

of my meditations to my dear friend 
Durége with whom I had a long 

and hively discussion. 


RICHARD DEDEKIND 


CHAPTER 


PROVISIONAL DEFINITION 


FUNCTIONS 


Undoubtedly the most important concept in all of mathematics is that of a 
function in almost every branch of modern mathematics functions turn out to 
be the central objects of investigation. It will therefore probably not surprise you 
to learn that the concept of a function is one of great generality. Perhaps it will 
be a relief to learn that, for the present, we will be able to restrict our attention to 
functions of a very special kind; even this small class of functions will exhibit sufh- 
cient variety to engage our attention for quite some time. We will not even begin 
with a proper definition. For the moment a provisional definition will enable us to 
discuss functions at length, and will illustrate the intuitive notion of functions, as 
understood by mathematicians. Later, we will consider and discuss the advantages 
of the modern mathematical definition. Let us therefore begin with the following: 


A function 1s a rule which assigns, to each of certam real numbers, some other real 
number. 


The followmg examples of functions are meant to illustrate and amplify this defi- 
nition, which, admittedly, requires some such clarification. 


Example 1 The rule which assigns to each number the square of that number. 


Example 2. ‘The rule which assigns to each number y the number 


yer oy 
yal 


Example 3 The rule which assigns to each number c 4 |, —1 the number 


o+3c+5 


al 


Example 4 ‘The rule which assigns to each number x satisfying —17 < x < 2/3 
the number x?. 

Example 5 The rule which assigns to each number a the number 0 if @ is 
irrational, and the number | if @ is rational. 


Example 6 ‘The rule which assigns 


to 2 the number 5, 


36 
to 17 the number —, 
rh 
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513 
to 7 the number 28, 


to oC the number 28, 
1 
and to any y # 2, 17, 27/17, or 36/z, the number 16 if y is of the form a+ b/2 
fora,binQ. 

Example 7 ‘The rule which assigns to each number ¢ the number ¢* +x. (This 
rule depends, of course, on what the number x is, so we are really describing 
infinitely many different functions, one for each number x.) 

Example 8 ‘Vhe rule which assigns to each number < the number of 7’s in the 
decimal expansion of z, if this number is finite, and —z if there are infinitely many 
7’s in the decimal expansion of <. 


One thing should be abundantly clear from these examples —a function is ay 
rule that assigns numbers to certain other numbers, not just a rule which can 
be expressed by an algebraic formula, or even by one uniform condition which 
applies to every number; nor 1s it necessarily a rule which you, or anybody else, 
can actually apply in practice (no one knows, for example, what rule 8 associates 
to wz). Moreover, the rule may neglect some numbers and it may not even be clear 
to which numbers the function applies (try to determine, for example, whether the 
function in Example 6 applies to 7). ‘The set of numbers to which a function does 
apply is called the domain of the function. 

Before saying anything else about functions we badly need some notation. Since 
throughout this book we shall frequently be talking about functions (indeed we shall 
hardly ever talk about anything else) we need a convenient way of naming func- 
tions, and of referring to functions in general. The standard practice is to denote 
a function by a letter. For obvious reasons the letter “f~ is a favorite, thereby 
“e” and “h” other obvious candidates, but any letter (or any reasonable 
x” andy 


making 


ee pes 


symbol, for that matter) will do, not excluding , although these letters 
are usually reserved for indicating numbers. If f is a function, then the number 
which f associates to a number x 1s denoted by f(x) this symbol is read “f of 
x” and is often called the value of f at x. Naturally, if we denote a function by x, 
some other letter must be chosen to denote the number (a perfectly legitimate, 
though perverse, choice would be “f,” leading to the symbol x(f)). Note that the 
symbol f(x) makes sense only for x m the domain of f; for other x the symbol 
f(x) is not defined. 

If the functions defined in Examples 1-8 are denoted by f,,2, 4, r, 5, 0, @x, 


and y, then we can rewrite their definitions as follows: 


(1) f(x) =" for all x! 
34 3y 45 
(2) g(y)= yee for all y. 
yr-+] 
3 
~4+3e¢45 | 
SkG).= a forall c # 1, 1, 
mw 
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(4) r(x) =x? for all x such that -17 < x < 7/3. 


(5), sO | QO, x irrational 


l_* & rational: 


a YS = 2 
36 
—, x=17 
IE 
x2 
36 
28, ————— 
IT 


4 


It 
ie ato 
Le 17 


ee FG) = r+x forall numbers fr. 


36 
Or —eand «aw + bV2 for a, b in Q. 
4 


Pee n, exactly n 7’s appear in the decimal expansion of x 
1 — wee : 
; —z, infinitely many 7’s appear in the decimal expansion of x. 


These definitions illustrate the common procedure adopted for defining a func- 
tion f —indicating what f (+) is for every number x in the domain of f. (Notice 
that this is exactly the same as indicating f(a) for every number a, or f(b) for ev- 
ery number J, etc.) In practice, certain abbreviations are tolerated. Definition (1) 
could be written simple 


Ciyme f(x) ex" 


the qualifying phrase “for all x” being understood. Of course, for definition (4) 
the only possible abbreviation is 


(4) r(x) =x, mile be 6 
It is usually understood that a definition such as 
] | 
AX ee ————— | 
x x«—l 
can be shortened to 


l l 
AX) = = ai 


ie | 


in other words, wiless the domain is explicitly restricted further, it rs understood to consist of 
all numbers for which the definition makes any sense at all. 


You should have little difficulty checking the following assertions about the fune- 
tions defined above: 


FOr) = fee 42a ; 
69 ),"Cople ca tae De bs 
raxat)l=rw)+2x4+ lif -lI7<x«< = ey | 
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s(x + y) = s(x) if y is rational: 


aes (2): 
17 x) 


a(xy=x-[f) +1]; 


If the expression f(s (@)) looks unreasonable to you, then you are forgetting that 
s(a) is a number hke any other number, so that f(s(a@)) makes sense. As a matter 
of fact, f(s(a)) = s(a) for all a. Why? Even more complicated expressions than 
f (s(a)) are, afier a first exposure, no more difficult to unravel. ‘The expression 


f (r(s(O(a3(¥(5)))))). 
formidable as it appears, may be evaluated quite easily with a little patience: 


f(r(s(8(a3(9G)))))) 
= f (r(s(6(a3(0))))) 
ty (rs (7(3)))) 
= (sli) 
aaa ly) 
fQ) 
=|. 


The first few problems at the end of this chapter give further practice manipulating 
this symbolism. 

The function defined in (1) is a rather special example of an extremely impor- 
tant class of functions, the polynomial functions. A function f is a polynomial 
function if there are real numbers do, ... , d, such that 


1 


= 3) 
f(X) = ayx" + yy x" +s Fagx” + ayx + a9, for all x 


(when f(x) is written in this form it is usually tacitly assumed that a, 4 0). The 
highest power of x with a nonzero coefficient is called the degree of f; for 
example, the polynomial function f defined by f(x) = 5x° + 1374 — has 
degree 6. 

‘The functions defined in (2) and (3) belong to a somewhat larger class of func- 
tions, the rational functions; these are the functions of the form p/q where p 
and q are polynomial functions (and g is not the function which is always 0). ‘The 
rational functions are themselves quite special examples of an even larger class of 
functions, very thoroughly studied in calculus, which are simpler than many of the 
functions first mentioned in this chapter. ‘Vhe following are examples of this kind 
of function: 

yy ty? tex sin? x 
Or ar (ny = [a 
X1SMIW El A SIN” 
(10) f(x) = sin(x?). 
(11) f(x) = sin(sin(x?)). 
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(12) f(x) = sin? (sin(sin?(x sin? x?))) « sin (aa). 


x - sins 


By what criterion, you may feel impelled to ask, can such functions, especially a 
monstrosity like (12), be considered simple? The answer is that they can be built 
up from a few simple functions using a few simple means of combining functions. 
In order to construct the functions (9)-(12) we need to start with the “identity 
function” J, for which (x) = x, and the “sine function” sin, whose value sin(x) at 
x is often written simple sin.x. The following are some of the important ways in 
which functions may be combined to produce new functions. 

If f and g are any two functions, we can define a new function f + g, called 
the sum of f and g, by the equation 


Of Seo) eo ar ere. 


Note that according to the conventions we have adopted, the domain of f + g 
consists of all x for which “ f(x) + g(x)” makes sense, 1.e., the set of all x in both 
domam f and domain g. If A and B are any two sets, then AM B (read “A 
intersect B” or “the intersection of A and B”) denotes the set of x m both A 
and B; this notation allows us to write domain(f + g) = domain f MN domain g. 


In a similar vein, we define the product f - g and the quotient f (or f/g) of 
§ 
f and g by 
Ci seven! — f(s) s) 


f) pa 3) 
is a g(x) 


Moreover, if g is a function and c is a number, we define a new function c- g by 


and 


(e 9g) (x) =.c- g(x). 


This becomes a special case of the notation f - g if we agree that the symbol c 
should also represent the function f defined by f(x) = c; such a function, which 
has the same value for all numbers x, is called a constant function. 

The domain of f - g 1s domain f M domain g, and the domain of c- g is simply 
the domain of g. On the other hand, the domain of f/g is rather complicated— it 
may be written domain f 1 domain gM {x : g(x) 4 O}, the symbol {x : g(x) 4 0} 
denoting the set of numbers x such that g(x) 4 0. In general, {x : ...} denotes 
the set of all x such that “ ...” is true. Thus {x : x3 +3 < 11} denotes the set of 
all numbers x such that x? < 8, and consequently {x :.x7+3 < 11} = {x:x <2}. 
Either of these symbols could just as well have been written using y everywhere 
iustead of x. Variations of this notation are common, but hardly require any 
discussion. Any one can guess that {x > 0: x3 < 8} denotes the set of positive 
numbers whose cube is less than 8; it could be expressed more formally as {x : 
x > Oand x? < 8}. Incidentally, this set is equal to the set {x :0 <x < 2}. One 
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variation 1s slightly less transparent, but very standard. The set {1,3,2,4}, for 
example, contains just the four numbers 1, 2, 3, and 4; it can also be denoted by 
lol = 1 or. =shor t= 2 or @= 4). 

Certain facts about the sum, product, and quotient of functions are obvious con- 
sequences of facts about sums, products, and quotients of numbers. For example, 
it is very easy to prove that 


Up ae We) ae ee ae WP ae 


The proof is characteristic of almost every proof which demonstrates that two 
functions are equal —the two functions must be shown to have the same domain, 
and the same value at any number in the domaim. For example, to prove that 
(f+g)+th= f+(g+h), note that unraveling the definition of the two sides gives 


((f+tgdth|@)=(f + e)(x) tha) 


[ f(x) + g(x)] +h) 


and 


[ft(g+h](x) = fix) + (gta) 
=f) le) tk@)); 


and the equality of [ f(x) + g(x)| +A(x) and f(x) + [g(v) + 4(x)] 1s a fact about 
numbers. In this proof the equality of the two domains was not explicitly men- 
tioned because this is obvious, as soon as we begin to write down these equations; 
the domain of (f +g) +h and of f + (g +h) is clearly domain f NM domain g N 
domain. We naturally write f +g +4 for (f +g) +4 = f +(g +A), precisely 
as we did for numbers. 

It is Just as easy to prove that (f-g)-h = f-(g-/), and this function 1s denoted 
by f-g-h. The equations f+ ¢=g+f and f-g =g- f should also present 
no difficulty. 

Using the operations +, -, / we can now express the funcuon f defined in (9) 
by 

 J+i-l+T-sm-sin 


sin] - sin -sm 


It should be clear, however, that we cannot express function (10) this way. We re- 
quire yet another way of combining functions. ‘This combination, the composition 
of two functions, is by far the most important. 

If f and g are any two functions, we define a new function fo g, the compo- 
sition of f and g, by 


C7 ce) =) (er); 


the domain of fog is {x sx is in domain g and g(x) is in domain f}. ‘Vhe symbol 
“fog” is often read “f circle g.” Compared to the phrase “the coniposition off 
and g” this has the advantage of brevity, of course, but there is another advantage 
of far greater import: there is much less chance of confusing fo g with go f, aud 
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these must not be confused, since they are not usually equal; in fact, almost any f 
and g chosen at random will illustrate this point (try f = /-/ and g = sin, for 
example). Lest you become too apprehensive about the operation of composition, 
let us hasten to point out that composition 7s associative: 


(fog)oh= fo(goh) 


(and the proof is a triviality); this function 1s denoted by fo g oh. We can now 
write the functions (10), (11), (12) as 

10); sine (a 

(i* 7) sires c(t ry 

(12) f = (sin- sin) osin o (sin- sin) o (/ - [(sin- sin) o (J - /)]) - 


; I +sino (/ - sin) 
sin o {| ————————_— ]. 
7+ sin 


One fact has probably already become clear. Although this method of writing 
functions reveals their “structure” very clearly, it is hardly short or convenient. ‘The 
shortest name for the function f such that f(x) = sin(x?) for all x unfortunately 
seems to be “the function f such that f(x) = sin(x7) for all x.” The need for 
abbreviating this clumsy description has been clear for two hundred years, but no 
reasonable abbreviation has received universal acclaim. At present the strongest 
contender for this honor is something like 


. 9) 
x => sin(x*) 


(read “x goes to sin(x*)” or just “x arrow sin(x?)”), but it is hardly popular among 
writers of calculus textbooks. In this book we will tolerate a certain amount of 
ellipsis, and speak of “the function f(x) = sin(x?).” Even more popular is the 
quite drastic abbreviation: “the function sin(x?).” For the sake of precision we 
will never use this description, which, strictly speaking, confuses a number and 
a function, but it is so convenient that you will probably end up adopting it for 
personal use. As with any convention, utility is the motivating factor, and this 
criterion is reasonable so long as the slight logical deficiencies cause no confusion. 
On occasion, confusion wl arise unless a more precise description is used. For 
example, “the function x +” is an ambiguous phrase; it could mean either 


aeappttet ve. the funetion / such slau f(x) wet t? for all x 


or 


t—> x+r>,ie., the function f such that f(t) =x +P? for all r. 


As we shall see, however, for many mmportant concepts associated with functions, 
calculus has a notation which contains the “x +” built in. 

By now we have made a sufhiciently extensive investigation of functions to war- 
rant reconsidering our definition. We have defined a function as a “rule,” but it is 
hardly clear what this means. If we ask “What happens if you break this rule?” it 
Is not easy to say whether this question is merely facetious or actually profound. 
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A more substantial objection to the use of the word “rule” is that 


f@) = me 
and 
f(x) x24 3x +3 —3@ 41) 


are certainly different rules, if by a rule we mean the actual instructions given for 
determiming f(x); nevertheless, we want 


fia x 
and 


f@Q)= Pee eRe Ce 1) 


to define the same function. For this reason, a function is sometimes defined as an 
“association” between numbers; unfortunately the word “association” escapes the 
objections raised against “rule” only because it is even more vague. 

There is, of course, a satisfactory way of defining functions, or we should never 
have gone to the trouble of criticizing our original definition. But a satisfactory 
definition can never be constructed by finding synonyms for English words which 
are troublesome. ‘The definition which mathematicians have finally accepted for 
“function” is a beautiful example of the means by which intuitive ideas have been 
incorporated into rigorous mathematics. The correct question to ask about a 
function is not “What is a rule?” or “What is an association?” but “What does 
one have to know about a function in order to know all about it?” ‘The answer to 
the last question is easy-- for each number x one needs to know the number / (x); 
we can imagine a table which would display all the information one could desire 
about the function f(x) = x?: 


x ee) 

l 

—| l 
2 4 
a 4 
2 


It is not even necessary to arrange the numbers in a table (which would actually 
be impossible if we wanted to list all of them). Instead of a two column array we 
can consider various pairs of numbers 


(i. i), la). Cet) —2, 4), er), 2, 2) 


DEFINITION 


DEFINITION 
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simply collected together into a set.* ‘To find f(1) we simply take the second 
number of the pair whose first member is 1; to find f(z) we take the second 
number of the pair whose first member is 7. We seem to be saying that a function 
might as well be defined as a collection of pairs of numbers. For example, if we 
were given the following collection (which contains just 5 pairs): 


i = heli] ) (Byes Sedo (AS), MSA YE 
then (l= ff (Oy Oo) = os a — 0, ee) = nd Ie oe, So. Slane tne 
only numbers in the domain of f. If we consider the collection 

PS 0G Dis 2, 5) ,8), (8,4), 


then f(3) = 7, f(2) =5, f(8) = 4; but it is impossible to decide whether f(1) = 7 
or f(1) = 8. In other words, a function cannot be defined to be any old collection 
of pairs of numbers; we must rule out the possibility which arose in this case. We 
are therefore led to the following definition. 


A function is a collection of pairs of numbers with the following property: if 


(a, b) and.(a,c),are both, m, the¢-collegtion,.then.b =.c; mother avords, the 
collection must not contain two different pairs with the same first element. 


This is our first full-fledged definition, and illustrates the format we shall always 
use to define significant new concepts. ‘These definitions are so important (at 
least as important as theorems) that it is essential to know when one is actually 
at hand, and to distinguish them from comments, motivating remarks, and casual 
explanations. ‘Vhey will be preceded by the word DEFINITION, contain the term 
being defined in boldface letters, and constitute a paragraph unto themselves. 

There is one more definition (actually defining two things at once) which can 
now be made rigorously: 


If f is a function, the domain of / is the set of all a for which there is some b 
such that (a,b) isin f. If ais in the domamm of f, it follows from the definition 


of a function that there is, in fact, a unique number Bb such that (a,b) is in f. 
This unique b is denoted by f(a). 


With this defmition we have reached our goal: the important thing about a 
function f is that a number f(x) is determined for each number x in its domain. 
You may feel that we have also reached the point where an intuitive definition has 
been replaced by an abstraction with which the mind can hardly grapple. ‘Two 
consolations may be offered. First, although a function has been defined as a 


* The pairs occurring here are often called “ordered pairs,” to emphasize that, for example, (2, 4) is 
not the same pair as (4, 2). It is only fair to warn that we are going to define functions im terms of 
ordered pairs, another undefined term. Ordered pairs can be defined, however, and an appendix to 
this chapter has been provided for skeptics. 
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collection of pairs, there is nothing to stop you from thinking of a function as a 
rule. Second, neither the intuitive nor the formal definition indicates the best way 
of thinking about functions. ‘Vhe best way is to draw picturcs; but this requires a 
chapter all by itself. 


PROBLEMS 
Le plicten() — H/C )) VWhtatis 
GQ)  f(f(x)) (for which x does this make sense?). 


i) f (=). 


(in) f(cx). 
(i) ay 
) f@)+ fQ). 
(vi) For which numbers c is there a number x such that f(cx) = f(x). 
Hint: ‘Vhere are a lot more than you might think at first glance. 
(vu) For which numbers c is it true that f(cx) = f(x) for two different 
numbers x? 


2. Let g(x) =x’, and let 


0, x rational 
|, <x tational. 


h(x) = 


(i) For which y is h(y) < y? 
(i) For which y is h(y) < g(y)? 
(iil) What is g(h(z)) — A(z)? 

( 

( 


iv) For which w is g(w) < w? 
v) For which ¢ is g(g(€)) = g(e)? 


3. Find the domain of the functions defined by the following formulas. 
(i) f Gieeay | — x2. 


(i) f@) =V1—- V1 —2?. 


be l ] 
(111) Jee | he 


Gv) =f) =v | — Be eee Sa 
(Vv) fGj= V/l—x+ Vee 


). F . . ~ ~ . 
4. LetS@y=x*, let P@y= 2, and let s(x) sine, Haid Rach of the tollowine: 
In each case you answer should be a number. 


(1), goon@:Seeuries) (@y ). 

(ii) »(S.os)(y). 

(it) (So Pos)(t)+(so P)(t). 

(iv) s(t). 

Express each of the following functions mn terms of S, P, s, using only 
+, +, and o (for example, the answer to (1) is Pos). In each case your 
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answer should be a function. 


(i 
(ii 
(ii) 
i 

( 


Iv) 


v) 


(v1) 


(vil) 


(viii) 


{@ =. Qsin x 
7 Cosi 2 
f(x) =sinx?. 
Seep Se as . . . . . 
f(x) = sin® x (remember that sin x is an abbreviation for (sin x)?). 
f(t) = 27. (Note: a” always means a®?; this convention is adopted 


because (a”)° can be written more simply as a.) 


f@).=sn@* + 2"), 
{y= sin (sin (sin (2? yD: 


f(a) —- Qsin? ete sin(a7) ts Qsin(a* +sina)_ 


asin y 


Polynomial functions, because they are simple, yet flexible, occupy a favored 


role in most investigations of functions. The following two problems illustrate their 


flexibility, and guide you through a derivation of their most important elementary 


properties. 


6. 


(a) 


(b) 


If xy, ..., X, are distinct numbers, find a polynomial function fj; of 
degree u — | which ts 1 at x; and O at x; for j Ai. Hint: the product of 
all (x — x;) for j Ai, is O at x; if j Ai. (This product is usually denoted 


by 
n 

[[@ = 2s 

j=l 

IE | 
the symbol M1 (capital pi) playing the same role for products that & plays 
for sums.) 
Now find a polynomial function f of degree n — 1 such that f(x;) = aj, 
where aj, ..., G are given numbers. (You should use the functions 
fi from part (a). The formula you will obtain is called the “Lagrange 
interpolation formula.”) 


Prove that for ary polynomial function f, and any number a, there is a 
polynomial function g, and a number J, such that f(x) = (x —a) g(x) +b 
for all x. (The idea is simply to divide (x ~ a) mto f (x) by long division, 
until a constant remainder is left. For example, the calculation 


x? 44x —2 
x—l )x3 —3x+1 
3 2 
ie eg— Xe 
x? —3x 
2) 
Sa —Xx 
—2x+1 
—2x+2 
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shows that x° =3x<t 1 = i(e=_al Gr? 4+ x — 2) 21 eAnformal proof is 
possible by induction on the degree of f.) 

(b) Prove that if f(a) = 0, then f(x) = (w — a)g(x) for some polynomial 
function g. (The converse is obvious.) 

(c) Prove that if f is a polynomial function of degree n, then f has at most 
n roots, 1.e., there are at most nm numbers a with f(a) = 0. 

(d) Show that for each n there is a polynomial function of degree n with 
n roots. If nis even find a polynomial function of degree n with no 
roots, and if n is odd find one with only one root. 


8. For which numbers a, b, c, and d will the function 


ax +b 
= cx +d 


satisfy f(f(x)) =x for all x (for which this equation makes sense)? 


9. (a) If A is any set of real numbers, define a function Cy, as follows: 


Pe an A 


Cre — : 
AW) Of notin Ae 


Find expressions for Cang and Caug and Cp_a, in terms of C4 and Cz. 
(The symbol AQ B was defined in this chapter, but the other two may 
be new to you. ‘They can be defined as follows: 


ATS — {a4 1s mn -A or % is mop) 
R— A = {x : x is in R but x 1s not in A}.) 


(b) Suppose f is a function such that f(x) = 0 or | for each x. Prove that 
hicne issasset A Suchthat 7 — (4. 
(c) Show that f = f? ifand only if f = C, for some set A. 
10. (a) For which functions f is there a function g such that f = g-? Hint: You 
can certainly answer this question if “function” is replaced by “number.” 
(b) For which functions f is there a function g such that f = 1/g? 
*(c) For which functions b and ¢ can we find a function x such that 


(x(t)? + D(x (t) + c(t) = 0 


for all numbers ¢ ? 
*(d) What conditions must the functions a and b satisfy if there is to be a 
function x such that 


a(t)x(t) + b(t) =0 
for all numbers 1? How mauy such functions x will there be? 


11. (a) Suppose that H is a function and y is a number such that H(H(y)) = y. 
What ts 
HH (H+ + (H@)---)2 


80 tines 


We 


i 


at, 


15. 


*16, 
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(b) Same question if 80 is replaced by 81. 
(c) Same question if H(H(y)) = H(y). 

*(d) Find a function H such that H(H(x)) = H(x) for all numbers x, and 
such tadted.i— 300 (2) = 7/3, 15) = BioOr Oe E53) = 
m/3, H(47) = 47. (Don’t try to “solve” for H(x); there are many func- 
tions H with H(H(x)) = H(x). The extra conditions on H are supposed 
to suggest a way of finding a suitable H.) 

*(e) Find a function H such that H(A (x)) = H(x) for all x, and such that 
Hl) = ara ike 


A function f is even if f(x) = f(—x) and odd if f(x) = —f(—x). for 
example, f is even if f(x) = x? or f(x) = |x| or f(x) = cosx, while f is 


odd il {U-) = xsor ge) =x. 


(a) Determine whether f + g is even, odd, or not necessarily either, in the 
four cases obtained by choosing f even or odd, and g even or odd. (Your 
answers can most conveniently be displayed in a 2 x 2 table.) 

(b) Do the same for f - g. 

(c) Do the same for f og. 

(d) Prove that every even function f can be written f(x) = g(|x|), for in- 
finitely many functions g. 


(a) Prove that any function f with domain R can be written f = E + O, 
where E is even and O is odd. 

(b) Prove that this way of writing f is unique. (If you try to do part (b) first, 
by “solving” for E and O you will probably find the solution to part (a).) 


If f is any function, define a new function |f| by |f|(x) = |f@)|. If f 
and g are functions, define two new functions, max(f, g) and min(f, g), by 


INA) 2). ) = ilaK( (x), e(4.)), 
Tin 7,2) Ce) — maint Ac), olr)). 


Find an expression for max(f, g) and min(f, g) in terms of | |. 


(a) Show that f = max(f,0) + min(f,0). This particular way of writing 
f is fairly useful; the functions max(f. 0) and min(f,0) are called the 
positive and negative parts of f. 

(b) A function f is called nonnegative if f(x) > 0 for all x. Prove that any 
function f can be written f = g — h, where g and hh are nonnegative, 
in infinitely many ways. (The “standard way” is g = max(f, 0) and h = 
—min(f, 0).) Hint: Any nwnber can certainly be written as the difference 
of two nonnegative numbers in infinitely many ways. 


Suppose f satisfies f(x + y) = f(x) + f() for all x and y. 


(ay "Prove that f(xy + ---+an) = f@1r +::-+ fn). 

(b) Prove that there is some number c¢ such that f(v) = cx for all rational 
numbers x (at this point we're not trying to say anything about f(x) for 
rational x). Hint: First figure out what c must be. Now prove that 
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f(x) = ex, first when x is a natural number, then when x is an integer, 
then when x is the reciprocal of an integer and, finally, for all rational x. 


If f(x) =0 for all x, then f satisfies f(x + y) = f(x) + f(y) for all x and y, 
and also f(x-y) = f(x)- f() for all x and y. Now suppose that f satisfies 
these two properties, but that f(x) is not always 0. Prove that f(x) = x for 
all x, as follows: 


Prove that’ f.(l\) =. 

Prove that f(x) =~ if x is rational. 

Prove that f(x) > Oif x > O. (This part is tricky, but if you have 
been paying attention to the philosophical remarks accompanying the 
problems in the last two chapters, you will know what to do.) 

Prove that f()'= "7 nl = ay. 

Prove that f(x) = x for all x. Hint: Use the fact that between any two 
numbers there is a rational number. 


Precisely what conditions must f, g, 4, and k satisfy in order that f(x)g(y) = 
h(x)k(y) for all x and y? 


(a) 


Prove that there do not exist functions f and g with either of the followmg 
properties: 


G) f(x) + 8(y) =<xy for all x and y. 
(iy a ety) = x -entor all eanday. 


Hint: ‘Try to get some information about f or g by choosing particular 
values of x and y. 
Find functions f and g such that f(x + y) = g(xy) for all x and y. 


Find a function f, other than a constant function, such that | f(y) — 
OD =p! — x. 

wlppose that f(y) — f(x) = (yy — 4) foralieaandiy, (Why does this 
imply that | f(y) — f(x)| < (y—x)*?) Prove that f is a constant function. 
Hint: Divide the interval from x to y into n equal pieces. 


Prove or give a counterexample for each of the following assertions: 


(a) 
(b) 


(c) 


joe Ci) = | of ey Ode 
(g+hyoy=goftho f. 


Suppose g =o f. Prove thatif f(@*)= f(y), then g@) = e()). 
Conversely, suppose that f and g are two functions such that g(v) = g(y) 
whenever f(x) = f(y). Prove that g =/o f for some function . Hint: 


Just try to define h(z) when z is of the form z = f(x) (these are the only ¢ 


that matter) and use the hypotheses to show that your definition will not 


run mito trouble. 
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Suppose that fo g = J, where /(x) = x. Prove that 


a) Meee y SUC hee X zie 


b) every number b can be written b = f(a) for some number a. 

a) Suppose g is a function with the property that g(x) # g(y) if x F y. 
Prove that there is a function f such that fog =/. 

b) Suppose that f is a function such that every number b can be written 


b = f(a) for some number a. Prove that there is a function g such that 
fog=l. 


Find a function f such that go f = / for some g, but such that there is no 
function A with f oh =T. 


Suppose f o go= JpandyRic f=... Prove that g = h. Hint: Use the facet that 
composition is associative. 


(a) Suppose f(x) = x+1. Are there any functions g such that fog = go f? 

(b) Suppose f is a constant function. For which functions g does fo g = 
Scie? 

(c) Suppose that fog = g° f for all functions g. Show that f is the identity 
faniction, 7 (27 —ee. 


(a) Let F be the set of all functions whose domain is R. Prove that, using + 
and - as defined in this chapter, all of properties P1- P9 except P7 hold 
for F, provided 0 and | are interpreted as constant functions. 

(b) Show that P7 does not hold. 

*(c) Show that PIO-P12 cannot hold. In other words, show that there is 
no collection P of functions in F, such that PIO-P12 hold for P. (It is 
sufficient, and will simplify things, to consider only functions which are 0 
except at two points x9 and xy.) 

(d) Suppose we define f < g to mean that f(x) < g(x) for all x. Which of 
P’10—P’13 (in Problem 1-8) now hold? 

(ce) gli af <= Sigh eleeeiiece? els f Op go}? 
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APPENDIX. ORDERED PAIRS 


Not only in the definition of a function, but in other parts of the book as well, 
it is necessary to use the notion of an ordered pair of objects. A definition has 
not yet been given, and we have never even stated explicitly what properties an 
ordered pair is supposed to have. ‘The one property which we will require states 
formally that the ordered pair (a,b) should be determined by a and b, and the 
order in which they are given: 


lita, ») = (c,d), thane = c andk — a. 


Ordered pairs may be treated most conveniently by simply introducing (a, b) 
as an undefined term and adopting the basic property as an axiom—since this 
property is the only significant fact about ordered pairs, there is not much point 
worrying about what an ordered pair “really” is. ‘Those who find this treatment 
satisfactory need read no further. 

The rest of this short appendix is for the benefit of those readers who will feel 
uncomfortable unless ordered pairs are somehow defined so that this basic property 
becomes a theorem. ‘There is no point in restricting our attention to ordered pairs 
of numbers; it is just as reasonable, and just as important, to have available the 
notion of an ordered pair of any two mathematical objects. ‘This means that our 
definition ought to involve only concepts common to all branches of mathematics. 
The one common concept which pervades all areas of mathematics is that of a 
set, and ordered pairs (like everything else in mathematics) can be defined in this 
context; an ordered pair will turn out to be a set of a rather special sort. 

The set {a, b}, containmg the two elements a and 4, is an obvious first choice, 
but will not do as a definition for (a,b), because there is no way of determining 
from {a,b} which of a or b is meant to be the first element. A more promising 
candidate is the rather starthng set: 


{el aul Sia late 


This set has two members, both of which are themselves sets; one member is the set 
{a}, containing the single member a, the other is the set {a, b}. Shocking as it may 
seem, we are going to define (a, b) to be this set. The justification for this choice 1s 
given by the theorem immediately following the definition- the definition works, 
and there really isn’t anything else worth saying. 


(a, b) = tla}, [azo 


If (a,b) = (c,d), then a =c and b=d. 


‘The hypothesis means that 
{a}, {a,b} } = Cle}, fe. d}}. 


Now {{a}, {a,b}} contains just two members, {a} and {a,b}; and a is the only 
common element of these two members of { {a}, {a,b} }. Similarly, ¢ is the unique 


3, Appendix. Ordered Pairs 55 


common member of both members of { {c}, {c.d}}. Therefore a = c. We there- 
fore have 
{{a}, {a,b}} = { {a}. {a,d}}, 


and only the proof that b = d remains. It is convenient to distinguish 2 cases. 


Case 1. b =a. In this case, {a, b} = {a}, so the set { {a}, {a, b}} really has only one 
member, namely, {a}. The same must be true of { {a}, {a,d}}, so {a,d} = {a}, 
which implies that d = a = b. 


Case 2. b # a. In this case, b is in one member of { {a}, {a,b}} but not in the 
other. It must therefore be true that b is in one member of { {a}, {a,d}} but not 
in the other. This can happen only if b is in {a, d}, but 6 is not in {a}; thus b =a 
or b=d,buth #4a;sob=d. J 
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Mention the real numbers to a mathematician and the image ofa straight Ime will 
probably form in her mind, quite involuntarily. And most likely she will neither 
banish nor too eagerly embrace this mental picture of the real numbers. “Geomet- 
ric intuition” will allow her to mterpret statements about numbers in terms of this 
picture, and may even suggest methods of proving them. Although the properties 
of the real numbers which were studied in Part I are not greatly illuminated by a 
geometric picture, such an interpretation will be a great aid in Part II. 

You are probably already famihar with the conventional method of considering 
the straight line as a picture of the real numbers, 1.e., of associating to each real 
number a point on a line. ‘To do this (Figure 1) we pick, arbitrarily, a pomt which 
we label 0, and a point to the right, which we label 1. ‘The point twice as far to 
the right is labeled 2, the point the same distance from 0 to 1, but to the left of 0, 
is labeled —1, ete. With this arrangement, if a < b, then the point corresponding 
to a hes to the left of the point corresponding to b. We can also draw rational 
numbers, such as 5, in the obvious way. It is usually taken for granted that the 
irrational numbers also somehow fit into this scheme, so that every real number 
can be drawn as a point on the line. We will not make too much fuss about 


justifying this assumption, since this method of “drawing” numbers is intended 


solely as a method of picturing certain abstract ideas, and our proofs will never 
rely on these pictures (although we will frequently use a picture to suggest or help 
explam a proof). Because this geometric picture plays such a prominent, albeit 
inessential role, geometric terminology 1s frequently employed when speaking of 
numbers thus a number is sometimes called a pomt, and R is often called the 
real line. 

The number |a—)| has a simple interpretation im terms of this geometric picture: 
it is the distance between a and b, the length of the line segment which has a as one 
end pomt and as the other. This means, to choose an example whose frequent 
occurrence justifies special consideration, that the set of numbers x which satisfy 
|x — a| < € may be pictured as the collection of pomts whose distance from a is 
less than e. This set of points is the “interval” from a — € to a+ €, which may also 
be described as the points corresponding to numbers x with a—€ <x <a+eé 
(Figure 2). 

Sets of numbers which correspond to intervals arise so frequently that it is desir- 
able to have special names for them. ‘The set {x :a@ <x < b} is denoted by (a, b) 
and called the open interval from a to b. This notation naturally creates some 
ambiguity, since (a, b) is also used to denote a pair of numbers, but in context it is 
always clear (or can easily be made clear) whether one is talking about a pair or 
an interval. Note that if a > b, then (a, b) = ¥, the set with no elements; in prac- 
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tice, however, it is almost always assumed (explicitly if one has been careful, and 
implicitly otherwise), that whcnever an interval (a, b) is mentioned, the number a 
is less than Db. 


The set {v : a < x < b} is denoted by [a, b| and is called the closed interval 
from a to b. This symbol ts usually reserved for the case a < b, but it is sometimes 
used for a = b, also. The usual pictures for the intervals (a, b) and [a, b| are shown 
in Figure 3; since no reasonably accurate picture could ever indicate the difference 
between the two intervals, various conventions have been adopted. Figure 3 also 
shows certain “infinite” mtervals. The set {x : x > a} is denoted by (a, oo), 
while the set {x 5 = a} 1s denoted by [ayoo); the sets (—c6)a) and (— ee, a| are 
defined similarly. At this pomt a standard warning must be issued: the symbols oo 
and —oo, though usually read “infinity” and “minus infinity,” are purely suggestive; 
there is no number “oo” which satisfies oo > a for all numbers a. While the 
symbols oo and —oo will appear in many contexts, it is always necessary to define 
these uses in ways that refer only to numbers. The set R of all real numbers is 
also considered to be an “interval,” and is sometimes denoted by (—oo, 00). 


Of even greater interest to us than the method of drawing numbers is a method 
of drawing pairs of numbers. ‘This procedure, probably also familiar to you, re- 
quires a “coordinate system,” two straight lies mtersecting at right angles. To 
distinguish these straight lines, we call one the horizontal axis, and one the vertical 
axis. (More prosaic terminology, such as the “first” and “second” axes, is probably 
preferable from a logical poimt of view, but most people hold their books, or at 
least their blackboards, in the same way, so that “horizontal” and “vertical” are 
more descriptive.) Each of the two axes could be labeled with real numbers, but 
we can also label points on the horizontal axis with pairs (a, 0) and points on the 
vertical axis with pairs (0, >), so that the intersection of the two axes, the “origin” 
of the coordinate system, 1s labeled (0,0). Any pair (a,b) can now be drawn as 
in Figure 4, lying at the vertex of the rectangle whose other three vertices are la- 
beled (0, 0), (a, 0), and (0, 6). The numbers a and b are called the first and second 
coordinates, respectively, of the point determined in this way. 


Our real concern, let us recall, is a method of drawing functions. Since a func- 
tion is just a collection of pairs of numbers, we can draw a function by drawing 
each of the pairs in the function. The drawing obtained in this way is called the 
graph of the function. In other words, the graph of f contains all the points cor- 
responding to pairs (x, f(x)). Smee most functions contain infinitely many pairs, 
drawing the graph promises to be a laborious undertaking, but, in fact, many 
functions have graphs which are quite easy to draw. 


Not surprisingly, the simplest functions of all, the constant functions f(x) = c¢, 
have the simplest graphs. It is easy to see that the graph of the function f(x) = ¢ 
is a straight line parallel to the horizontal axis, at distance c from it (Figure 5). 


The functions f(x) = cx also have particularly simple graphs straight lines 
through (0,0), as in Figure 6. A proof of this fact is mdicated in Figure 7: 
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Let x be some number not equal to 0, and let L be the straight line which passes 
through the origin O, corresponding to (0,0), and through the pomt A, corre- 
sponding to (x, cx). A point A’, with first coordmate y, will lie on L when the 
triangle A’B’O is similar to the triangle ABO, thus when 


AiBem AB”Y 
OB’ OB 


(Cs 


this is precisely the condition that A’ corresponds to the pair (y, cy), Le., that A’ 
lies on the graph of f. The argument has implicitly assumed that c > 0, but the 
other cases are treated easily enough. The number c, which measures the ratio of 
the sides of the triangles appearing in the proof, is called the slope of the straight 
line, and a line parallel to this line is also said to have slope c. 

This demonstration has neither been labeled nor treated as a formal proof. 
Indeed, a rigorous demonstration would necessitate a digression which we are 
not at all prepared to follow. The rigorous proof of any statement connecting 
geometric and algebraic concepts would first require a real proof (or a precisely 
stated assumption) that the poits on a straight line correspond im an exact way 
to the real numbers. Aside from this, it would be necessary to develop plane 
geometry as precisely as we intend to develop the properties of real numbers. 
Now the detailed development of plane geometry is a beautiful subject, but it is by 
no means a prerequisite for the study of calculus. We shall use geometric pictures 
only as an aid to intuition; for our purposes (and for most of mathematics) it is 
perfectly satisfactory to define the plane to be the set of all pairs of real numbers, 
and to define straight lines as certain collections of pairs, including, among others, 
the collections {(x, cx): x a real number}. To provide this artificially constructed 
geometry with all the structure of geometry studied in high school, one more 
definition is required. If (a,b) and (c,d) are two points in the plane, 1.¢., pairs of 
real numbers, we define the distance between (a,b) and (c, d) to be 


JV (a —c)? + (b—d)?. 


If the motivation for this definition is not clear, Figure 8 should serve as adequate 
explanation—with this definition the Pythagorean theorem has been built into our 
geometry.* 

Reverting once more to our informal geometric picture, it is not hard to see 
(Figure 9) that the graph of the function f(~) = cx + d is a straight line 
with slope c, passing through the point (0.d). For this reason, the functions 
f(x) = cx +d are called linear functions. Simple as they are, Imear func- 
tions occur frequently, and you should feel comfortable working with them. The 
following is a typical problem whose solution should not cause any trouble, Given 
two distinct points (a,b) and (c,d), find the hnear function f whose graph goes 
through (a,b) and (c,d). This amounts to saying that f(a) = b and f(c) =d. If 


*Vhe fastidious reader might object to this definition on the grounds that nonnegative numbers 
are not yet known to have square roots. ‘Phis objection is really unanswerable at the moment the 
definition wall just have to be accepted with reservations, until this ttle pomt is settled. 
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f is to be of the form f(x) = ax + f, then we must have 


Gas P= Db, 
ac+B=d: 
therefore a = (d — b)/(c — a) and B = b — [(d — b)/(c — a) |a, so 
d—b d—b d—b 
ioe x+b— a= (x —a) +b, 
c—a c—a c-a 


a formula most easily remembered by using the “pomt-slope form” (see Problem 6). 

Of course, this solution is possible only if a 4 c; the graphs of linear functions 
account only for the straight lines which are not parallel to the vertical axis. ‘The 
vertical straight lines are not the graph of any function at all; in fact, the graph ofa 
function can never contain even two distinct pomts on the same vertical line. This 
conclusion is immediate from the definition ofa function — two points on the same 
vertical line correspond to pairs of the form (a, b) and (a, c) and, by definition, a 
function cannot contain (a, b) and (a,c) if b 4c. Conversely, if a set of points m 
the plane has the property that no two points lie on the same vertical line, then 
it is surely the graph ofa function. Thus, the first two sets in Figure 10 are not 
graphs of functions and the last two are; notice that the fourth is the graph of a 
function whose domain is not all of R, since some vertical lines have no points on 
them at all. 

After the linear functions the simplest is perhaps the function f(x) = x. If we 
draw some of the pairs in f, ie., some of the pairs of the form (x, x*), we obtain 
a picture like Figure I]. 
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It is not hard to convince yourself that all the pairs (x, x7) lie along a curve like 
the one shown in Figure 12; this curve is known as a parabola. 

Since a graph is just a drawing on paper, made (in this case) with printer’s ink, 
the question “Ts this what the graph really looks like?” is hard to phrase in any 
sensible manner. No drawing 1s ever really correct smce the line has thickness. 
Nevertheless, there are some questions which one can ask: for example, how can 
you be sure that the graph does not look lke one of the drawings in Figure 13? 
It is easy to see, and even to prove, that the graph cannot look lke (a); for if 
0 <x < y, then x* < y?, so the graph should be higher at y than at x, which is 
not the case in (a) . It 1s also easy to see, simply by drawing a very accurate graph, 
first plotting many pairs (x, x7), that the graph cannot have a large “jump” as in (b) 
or a “corner” as m (c). In order to prove these assertions, however, we first need 
to say, in a mathematical way, what it means for a function not to have a “jump” 
or “corner”; these ideas already involve some of the fundamental concepts of 
calculus. Eventually we will be able to define them rigorously, but meanwhile you 
may amuse yourself by attempting to define these concepts, and then examining 
your definitions critically. Later these definitions may be compared with the ones 
mathematicians have agreed upon. If they compare favorably, you are certainly 
to be congratulated! 

The functions f(x) = x", for various natural numbers n, are sometimes called 
power functions. Their graphs are most easily compared as in Figure 14, by 
drawing several at once. 

The power functions are only special cases of polynomial functions, mtroduced 
in the previous chapter. ‘Two particular polynomial functions are graphed in 
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Figure 15, while Figure 16 is meant to give a general idea of the graph of the 
polynomial function 


f(x) aa IN a Pa mp a (elie 


in the case a, > 0. 

In general, the graph of f will have at most 2 — 1 “peaks” or “valleys” (a “peak” 
is a point like (x, f(x)) in Figure 16, while a “valley” ts a pomt like (y, f(y)). The 
number of peaks and valleys may actually be much smaller (the power functions, 
for example, have at most one valley). Although these assertions are easy to make, 
we will not even contemplate giving proofs until Part III (once the powerful meth- 
ods of Part III are available, the proofs will be very easy). 

Figure 17 illustrates the graphs of several rational functions. ‘The rational func- 
tions exhibit even greater variety than the polynomial functions, but their behavior 
will also be easy to analyze once we can use the derivative, the basic tool of Part ITI. 

Many interesting graphs can be constructed by “piecing together” the graphs of 
functions already studied. The graph in Figure 18 is made up entirely of straight 
lines. The function f with this graph satisfies 
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and is a linear function on each interval [1/(n +1), 1/n] and [—1/n, -1/(n4+1)]. 
(The number 0 is not in the domain of f.) Of course, one can write out an explicit 
formula for f(x), when x is in [1/(2 + 1), 1/n]; this is a good exercise in the use 
of linear functions, and will also convince you that a picture is worth a thousand 
words. 


n even n odd 
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It is actually possible to define, in a much simpler way, a function which exhibits 
this same property of oscillating infinitely often near 0, by using the sine function, 
which we will discuss in detail in Chapter 15. As usual, we are using radian 
measure, so an angle of 27 means an angle “all the way around” a circle, an 
angle of z an angle half way around (or 180° in layman’s terms), an angle of 2/2 
a right angle, ete. 

The graph of the sine function is shown in Figure 19. 


(x)'=sin x 


FIGURE 19 


Now consider the function f(x) = sin 1/x. The graph of f is shown in Fig- 
ure 20. ‘To draw this graph it helps to first observe that 


fey=0 fi heed 
x)= rx= -—, —, — >... 
: : x 2n 3x 
< l | | 
LCD ore, 5 i FOR 
5 50 +27 570 +47 
; . l l l 
f):=—1 for x = =—, =. = —_——__........ 


3 3 3 
sm 5a +2n 52 +47 


Notice that when x 1s large, so that 1/x is small, f(x) is also small; when x 1s 


f(x) =sin- 
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FIGURE 20 
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“large negative,” that is, when || is large for negative x, again f(x) is close to 0 
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although f(x) < 0. 
An interesting modification of this function is f(x) = x sin1/x. The graph of 


this function is sketched in Figure 21. Since sin 1/x oscillates infinitely often near 0 
between | and —1, the function f(x) = x sin 1/x oscillates infinitely often between 
x and —x. The behavior of the graph for x large or large negative is harder to 
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analyze. Since sin |/x is getting close to 0, while x is getting larger and larger, there 
seems to be no telling what the product will do. It 7s possible to decide, but this 1s 
another question that is best deferred to Part III. The graph of f(x) = x? sin 1 /x 
has also been illustrated (Figure 22). 

For these infinitely oscillating functions, it is clear that the graph cannot hope to 
be really “accurate.” The best we can do is to show part of it, and leave out the 
part near O (which is the interesting part). Actually, it is easy to find much simpler 
functions whose graphs cannot be “accurately” drawn. The graphs of 
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can only be distinguished by some convention similar to that used for open and 
closed intervals (Figure 23). 
Out last example 1s a function whose graph is spectacularly nondrawable: 
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FIGURE 24 


The graph of f must contain infinitely many points on the horizontal axis and 
also mfinitely many points on a line parallel to the horizontal axis, but it must not 
contain either of these lines entirely. Figure 24 shows the usual textbook picture 
of the graph. ‘To distinguish the two parts of the graph, the dots are placed closer 
together on the line corresponding to irrational x. (There is actually a mathemat- 
ical reason behind this convention, but it depends on some sophisticated ideas, 
introduced in Problems 21-5 and 21-6.) 

The peculiarities exhibited by some functions are so engrossing that it is easy 
to forget some of the simplest, and most important, subsets of the plane, which 
are not the graphs of functions. The most important example of all is the circle. 
A circle with center (a, b) and radius r > 0 contains, by definition, all the points 
(x, y) whose distance from (a, b) is equal to r. The circle thus consists (Figure 25) 
of all pomts (x, y) with 


V(x -—a)*+(y-—bP =r 


or 
(x - a)? (y- by =r. 
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The circle with center (0,0) and radius 1, often regarded as a sort of standard copy, 
is called the wnt cucle. 

A close relative of the circle is the ellipse. This is defined as the set of points, 
the swum of whose distances from two “focus” points is a constant. (When the two 
foci are the same, we obtain a circle.) If, for convenience, the focus points are 
taken to be (—c, 0) and (c, 0), and the sum of the distances is taken to be 2a (the 
factor 2 simplifies some algebra), then (x, y) is on the ellipse if and only if 


Vv (x — (—e))? + y? + v(x —c)?+ y? = 2a 


or 
9 9 9 9 
V(x +c) + y° = 2a— V(x -—c)° + y° 
or 
Y 9 9 2 7 9 ; 9 9 
x° + 2ex +c° + y* = 4a~ —4a Vv (x —c)° + y* + x*° — 2ex +c" + y~ 
or 
7 9 9 
A(cx — a“) = —4a Vv (x —c)” + y* 
or 
7 9 ») ju 2 ? >) 
c7x? — 2cxa? +.a* = a(x? — 2cx +c? + y?) 
or 
9 2) 9 > 2 9 2; 9 
(ec —47)x- =4-y> =f (Cc =a") 
or 
. 
2 y? 
= ten Di | 
ae at =e 
This is usually written simply 
9 9 
t= . en 
=ztaoH i, 
ao) be 
9 9 . . . 
where b = Va->—c> (since we must clearly choose a > c, it follows that 


9 9 . . . . . ye , . . 
a~ —c~ > 0). A picture of an ellipse is shown in Figure 26. The ellipse inter- 
sects the horizontal axis when y = 0, so that 


5 
x2 
z= | x= +a, 
ay 
2 2 
(Cesyne Sco | 
(O, b) ne age b2 
(—a, 0) (a, 0) 
(O, —b) 


FIGURE 26 


FIGURE 27 
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and it intersects the vertical axis when x = 0, so that 


0) 
y~ 


—= = | Nit set 
Da 

The hyperbola is defined analogously, except that we require the difference of 
the two distances to be constant. Choosing the points (—c,0) and (c,0) once 
again, and the constant difference as 2a, we obtain, as the condition that (x, y) 


be on the hyperbola, 


oi Gece) ict y? a ee) 


which may be simplified to 


a 
Ae 

. ) 
In this case, however, we must clearly choose c > a, so that a* — Cx ta 0, eglt 


b = Vc? — a?, then (x, y) is on the hyperbola if and only if 


a2 bt 
The picture is shown in Figure 27. It contains two pieces, because the difference 
between the distances of (x, y) from (—c, 0) and (c,0) may be taken in two dif- 
ferent orders. The hyperbola intersects the horizontal axis when y = 0, so that 
x = +a, but it never intersects the vertical axis. 

It is interesting to compare (Figure 28) the hyperbola with a = b = V2 and 
the graph of the funcuon f(x) = 1/x. The drawings look quite similar, and 
the two sets are actually identical, except for a rotation through an angle of 2/4 
(Problem 23). 

Clearly no rotation of the plane will change circles or ellipses mto the graphs of 
functions. Nevertheless, the study of these important geometric figures can often 
be reduced to the study of functions. Ellipses, for example, are made up of the 


Ga Ro 28 
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graphs of two functions, 


fw=bvl-—- (x2 ar). —m=< x <a 
g(x) = —-bv1l—- (7 /a"y qx = a. 


Of course, there are many other pairs of functions with this same property. For 


and 


example, we can take 


BN l= (x2 /a-): =i = 


f(x) = 
—bvV1l— (Ja). —a<x <0 
and 
(x) =) Ga) (axa 
g(x) = 


bel = Ga =e =k <0, 
We could also choose 


2) B} 4 
bv 1 — (x*/a*), x rational -—a<x <a 


oo 
—bvl— Gao). x irrational, -a <x <a 
and 
—bvl-—- (x7 fa’), x rational —a<x<a 
g(x) = 


9 9 6 ti 
bv 1 —(x*/a*), x irrational.) <x. <a. 


But all these other pairs necessarily involve unreasonable functions which jump 
around. A proof, or even a precise statement of this fact, is too difficult at present. 
Although you have probably already begun to make a distinction between those 
functions with reasonable graphs, and those with unreasonable graphs, you may 
find it very difficult to state a reasonable definition of reasonable functions. A 
mathematical definition of this concept is by no means easy, and a great deal of this 
book may be viewed as successive attempts to impose more and more conditions 
that a “reasonable” function must satisfy. As we define some of these conditions, 
we will take time out to ask if we have really succeeded in isolating the functions 
which deserve to be called reasonable. ‘The answer, unfortunately, will always be 
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no,” or at best, a qualified “yes.” 


PROBLEMS 


1. Indicate on a straight line the set of all x satisfying the following conditions. 
Also name each set, using the notation for intervals (in some cases you will 
also need the U sign). 


@) kh = She ih 
Gi) Iv 3) = 6 


(i) |x —al <eé. 
ie) Ble =) 2 i. 
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] 
I 
(v) a 
l+x2~> 
(vi) < a (give an answer in terms of a, distinguishing various cases). 
| as x2 ce ) te he 


(vii) a> +l? 
(vit) (x + 1) -— Ii — 2) > 0. 


There is a very useful way of describing the points of the closed interval [a, b] 
(where we assume, as usual, that a < b). 


(a) First consider the interval [0, b], for b > 0. Prove that if x is in [0, 5] 
then « = tb for some t with O < t < 1. What is the significance of the 
number ¢? What is the mid-point of the interval [0, b]? 


b) 


(b) Now prove that if x is in [a, b], then x = (1 — t)a + tb for some ¢ with 
QO <+ <1. Hint: This expression can also be written as a + t(b — a). 
What is the midpoint of the interval [a,b]? What is the point 1/3 of the 
way from a to b? 

(c) Prove, conversely, that if 0 <+¢ < 1, then (1 —f)a + 1b is mn [a, b]. 

(d) The points of the open interval (a,b) are those of the form (1 — t)a + tb 
for O87 P 


Draw the set of all points (x, y) satisfying the following conditions. (In most 
cases your picture will be a sizable portion ofa plane, not just a line or curve.) 


ee 

(i) xta>ytb. 
ii) cee 

iv) ey eae 


x= yh. 
vi) “neha al 
(vil) x + y is an integer. 
| 
tay 
(ix) («-—1?4+(y-2)? <1. 


Comme ee 


is an integer. 


Draw the set of all pots (x, y) satisfying the following conditions: 


()  ixl+ ple. 

Gi) Wx|-Iyl=. 

Gi) jx — 1] =|y— 1]. 
Gv) |l—x|=[y— 1]. 
(vy) x7+ y2=0, 
(wiheeetay a) 

(an) ae ya 4 
(vili) x* = y?. 
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(1, m) 


HC WIRSE, 2.9 


7) 


Draw the set of all points (x, y) satisfying the following conditions: 


a) 
ee 
y2 x2 
31 = a = tN. 
a b 
x=lyl. 
1) ee 


Hint: You already know the answers when x and y are interchanged. 


(a) 


(b) 


(b) 


Show that the straight line through (a, b) with slope m is the graph of the 
function f(x) = m(x —a)+b. This formula, known as the “point-slope 
form” is far more convenient than the equivalent expression f(x) = 
mx + (b—ima); it is immediately clear from the point-slope form that the 
slope is m, and that the value of f at ais b. 

For a 4 c, show that the straight ine through (a,b) and (c,d) is the 
graph of the function 


f@= 


1—b 
: (x —a)+b. 
c—a 


When are the graphs of f(x) = mx + b and g(x) = mx + D’ parallel 
straight lines? 


For any numbers A, B, and C, with A and B not both 0, show that the 
set of all (x. y) satisfying Ax + By + C = 01s a straight line (possibly a 
vertical one). Hint: First decide when a vertical straight line 1s described, 
Show conversely that every straight line, including vertical ones, can be 
described as the set of all (x, y) satisfying Ax + By + C = 0. 


Prove that the graphs of the functions 
F@)=tx.+ b, 
2 Nee eal Ch 


are perpendicular if mn = —1, by computing the squares of the lengths 
of the sides of the triangle in Figure 29. (Why is this special case, where 
the hnes mtersect at the origin, as good as the general case?) 
Prove that the two straight lines consisting of all (x, y) sausfying the con- 
ditions 
Ax+ By +C =U, 
A'x + B’y+C'=0, 


are perpendicular if and only if AA’ + BB’ = 0. 


Prove, using Problem 1-19, that 


Vv (xy + y1)- + (x9 + Po)? = ake + x? a yy? oe yor. 


10. 


Ue 


We 


13. 


EE 
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(b) Prove that 
(3 — 41)? +03 —y1)? S$ V2 — 1)? +. 02 — 1)? 


+ 


oy 9y 
(x3 — x2)" + (y3 — y2)°. 


Interpret this mequality geometrically (it is called the “triangle mequal- 
ity’). When does strict inequality hold? 


Sketch the graphs of the following functions, plotting enough points to get 
a good idea of the general appearance. (Part of the problem is to make 
a reasonable decision how many is “enough”; the queries posed below are 
meant to show that a little thought will often be more valuable than hundreds 
of individual points.) 


‘ l 
(i) f(x) =x + —. (What happens for x near 0, and for large x? Where 
a 
does the graph he in relation to the graph of the identify function? Why 


does it suffice to consider only positive x at first?) 


l 
fo) SO See, 


~ 


(iii) f(x) =x? + 


=e — 
to 


2D 


iG) —7 


cea 
to 


Describe the general features of the graph of f if 


(1) | of 1S een. 

(i) 7 as odd: 

(in) f is nonnegative. 

(iv) f(x) = f(x +a) for all x (a function with this property is called peri- 
odic, with period a. 


Graph the functions f(x) = %/x for m = 1, 2, 3, 4. (There is an easy way to 
do this, using Figure 14. Be sure to remember, however, that %/x means the 
positive mth root of x when m is even; you should also note that there will be 
an important difference between the graphs when m is even and when mz is 
odd.) 


(a) Graph f(x) = |x| and f(x) = x?. 

(5) Graph 7 Gc) — | sinx|"and 7) — sin? x, (There is an important difler- 
ence between the graphs, which we cannot yet even describe rigorously. 
See if you can discover what it is; part (a) is meant to be a clue.) 


Describe the graph of g in terms of the graph of f if 
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iD: 


16. 


V7. 


18. 


1) Bg) — lays: 

i) =6g(x) = f(x +c). (It is casy to make a mistake here.) 
ii Roe cf (xX). 
Iv) Bo) — f(Cv): 


(Distinguish! the casese —"O, c =>'0, ¢ <0) 


vy) g(x) = f(l/x). 
1) g(x) = f(x). 
Vee it= | fO)|. 


1) ey) = nan 7,0): 
x)= eC mim( f, 0) 
¥) eee) — miaxts, 1): 


a 
Le 
— 

=. 
— 

—— 


NN 
Ke ES 
— 
war) 


Draw the graph of f(x) = ax? + bx +. Hint: Use the methods of Prob- 
lem 1-18. 


Suppose that A and C are not both 0. Show that the set of all (x, y) satisfying 
Axe Bat Cue Dyk — 0 


is either a parabola, an ellipse, or an hyperbola (or a “degenerate case”: two 
lines [either intersecting or parallel], one ne, a point, or ¥). Hint: The 
case C = OQ is essentially Problem 15, and the case A = 0 is just a minor 
variant. Now consider separately the cases where A and B are both positive 
or negative, and where one 1s positive while the other is negative. When do 
we have a circle? 


The symbol [x] denotes the largest integer which is < x. Thus, [2.1] = [2] = 
2 and [—0.9] = [—1] = —I. Draw the graph of the following functions 
(they are all quite interesting, and several will reappear frequently in other 
problems). 


() fe) = [x]. 

ai) f @)rearx —"[*]: 

ii) f(x) = Ve [aT 

(iv) f(x) = [x] + vx — [4]. 


(v) feo= |=] 
X 

(vi) f "ny 
x 


Graph the following functions. 


(i) f(x) = {x}, where {x} is defined to be the distance from x to the nearest 
integer, 

(i) f(x) = {2x}. 

(iii) f(x) = {x} + 3{2x}. 

(rv)” (Gi) Ss}. 

(vy) f(x) = bx} + F120} + Gl4e}. 


FIGURE 30 
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Many functions may be described in terms of the decimal expansion of a num- 
ber. Although we will not be m a position to describe infinite decimals rigorously 
until Chapter 23, your intuitive notion of infinite decimals should suffice to carry 
you through the following problem, and others which occur before Chapter 23. 
‘There is one ambiguity about infinite decimals which must be eliminated: Every 
decimal ending in a string of 9’s is equal to another ending in a string of 0°s (e.g., 


1.23999... = 1.24000...). We will always use the one ending in 9’s. 


*19. 


Ale 


Zi 


az: 


Describe as best you can the graphs of the following functions (a complete 
picture is usually out of the question). 


(1) f(x) = the Ist number in the decimal expansion of x. 

(ii) = f(*) =the 2nd number in the decimal expansion of x. 

(iit) f(x) = the number of 7’s in the decimal expansion of x if this number 
is finite, and O otherwise. 

(iv) f(x) = 0 if the number of 7’s in the decimal expansion of x is finite, 
and 1 otherwise. 

(vy) f(*) = the number obtained by replacing all digits in the decimal 
expansion of x which come after the first 7 (if any) by 0. 

(vi) f(x) =O if 1 never appears in the decimal expansion of x, and n if 1 
first appears in the nth place. 


Let ee 
QO, x irrational 


Ny) stellt ; 
fy) - x= ua rational in lowest terms. 
q q 


(A number p/q is in lowest terms if p and q are integers with no common 
factor, and q > 0). Draw the graph of f as well as you can (don’t sprinkle 
points randomly on the paper; consider first the rational numbers with q = 2, 
then those with g = 3, etc.). 


(a) The points on the graph of f(x) = x? are the ones of the form (x, x”). 
Prove that each such point is equidistant from the point (0, ) and the 
graph of g(x) = —f. (See Figure 30.) 

(b) Given a horizontal line L, the graph of g(x) = y, anda point P = (a, B) 
not on L, so that y # f, show that the set of all points (x, y) equidistant 
from P and L is the graph of a function of the form f(x) = ax*+bx +e. 
What is this set if y = B? 


(a) Show that the square of the distance from (c,d) to (x,m-x) is 
x?(m? + 1)+.x«(-—2md —2c)+ oes 


Using Problem 1-18 to find the minimum of these numbers, show that 
the distance from (c,d) to the graph of f(x) = mx is 


lem — d|/Vm? +1. 


(b) Find the distance from (c,d) to the graph of f(x) = mx +b. (Reduce 
this case to part (a).) 
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*23. (a) Using Problem 22, show that the numbers x’ and _ y’ indicated in Fig- 
ure 31 are given by 


| 1 
c= —=* 


+—=y, 
i 
l l 
= ——=x+-ny. 
J2 v2 
(b) Show that the set of all (x, y) with (x'/V2 \eo (y'/V2)P = | is the same 


FIGURE 31 as the set of all (x, y) with xy = 1. 


length y’ 2 


FIGURE |} 


(vy + wy), v2 + wW2) 


FIGURE 2 


(v1, v2) 


(U7 ws) v2 


(v]) + wy) — YY 
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ABRENDI Xela on VECIORS 


Suppose that v is a point in the plane; in other words, v is a pair of numbers 
v = (vj, V2). 


For convenience, we will use this convention that subscripts indicate the first and 
second pairs of a point that has been described by a single letter. ‘Thus, if we 
mention the poimts w and z, it will be understood that w is the pair (wy), w2), 
while z is the pair (z, 22). 

Instead of the actual pair of numbers (v1, v2), we often picture v as an arrow 
from the origin O to this point (Figure 1), and we refer to these arrows as vectors 
in the plane. Of course, we’ve haven't really said anything new yet, we've simply 
introduced an alternate term for a point of the plane, and another mental picture. 
The real point of the new termimology is to emphasize that we are going to do 
some new things with points in the plane. 

For example, suppose that we have two vectors (i.e., points) in the plane, 


V = Vie) tw, w2). 
Then we can define a new vector (a new point of the plane) v + w by the equation 
(1) v+w=(v+ uw). 2+ U2). 


Notice that all the letters on the right side of this equation are numbers, and the 
+ sign is just our usual addition of numbers. On the other hand, the + sign on 
the left side is new: previously, the sum of two points in the plane wasn’t defined, 
and we’ve simply used equation (1) as a definition. 

A very fussy mathematician might want to use some new symbol for this newly 
defined operation, like 


U+tw, or perhaps v@w, 


but there’s really no need to insist on this; since v + w hasn’t been defined before, 
there’s no possibility of confusion, so we might as well keep the notation simple. 
Of course, any one can make new notation; for example, since it’s our definition, 
we could just as well have defined v + w as (vj + wy + w2. v2 + wy"), or by some 
other equally weird formula. The real question is, does our new construction have 
any particular significance? 
Figure 2 shows two vectors v and w, as well as the point 


(uy Pw aD wo), 


which, for the moment, we have simply indicated in the usual way, without drawing 
an arrow. Note that it is easy to compute the slope of the line L between v and 
our new point: as indicated in Figure 2, this slope is just 


(v2 + w2) — v2 w 


(Yytw)-—vy wy 


c) 


and this, of course, is the slope of our vector w, from the origin O to (w), w2). In 
other words, the line L is parallel to w. 
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FIGURE 3 


FIGURE 4 


FIGURE 5 


Similarly, the slope of the line M between (wy), w2) and our new point is 
y; p 1, W2 } 


(HIG w 2) = wo) > 


(v; + w)) — v2 v1 


which is the slope of the vector v; so M is parallel to v. In short, the new point 
v+w lies on the parallelogram having v and w as sides. When we draw v + w as 
an arrow (Figure 3), it points along the diagonal of this parallelogram. In physics, 
vectors are used to symbolize forces, and the sum of two vectors represents the 
resultant force when two different forces are applied simultaneously to the same 
object. 

Figure 4 shows another way of visualizing the sum v-+w. Ifwe use “w” to denote 
an arrow parallel to w, and having the same length, but starting at v instead of at 
the origin, then v + w is the vector from O to the final endpoint; thus we get to 
u+ w by first following v, and then following w. 

Many of the properties of + for ordinary numbers also hold for this new + for 
vectors. For example, the “commutative law” 


v+w=w-+u, 


is obvious from the geometric picture, since the parallelogram spanned by v and 
w is the same as the parallelogram spanned by w and v. It 1s also easily checked 
analytically, since it states that 


(vj + wi, v2 + w2) = (wy) + V1, w2 + V2), 
and thus simply depends on the commutative law for numbers: 


vip tw) =w,t+ vy, 
Ud WW? — Woe UD. 


Similarly, unraveling definitions, we find the “associative law” 
[os w]e = || Ute 


Figure 5 indicates a method of finding v + w + z. 
The origin O = (0, 0) is an “additive identity,” 


O+vz=v+O0=n, 


and if we define 


—v = (—v). —U2), 


then we also have 


Use (SU — 0 =a 
Naturally we can also define 
w—-v=w4+(—-v), 
exactly as with numbers; equivalently, 


w—v = (wy — Vy, w2 — U2). 
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Just as with numbers, our definition of w — v simply means that it satisfies 
v+(w—v)=w. 


“w — vp” Figure 6(a) shows v and an arrow “w — v” that is parallel to w — v but that starts 
v at the endpoint of v. As we established with Figure 4, the vector from the origin 
to the endpoint of this arrow is just v + (w — v) = w (Figure 6(b)). In other words, 
we can picture w—v geometrically as the arrow that goes from v to w (except that 
(a) it must then be moved back to the origin). 
‘There is also a way of multiplying a number by a vector: For a number a and 
a vector v = (v1, v2), we define 


a-v = (avy, av2) 


TS a (We sometimes simply write av instead of a - v; of course, it is then especially 
important to remember that v denotes a vector, rather than a number.) ‘The 
v vector a+ v points in the same direction as v when a > 0 and in the opposite 


direction when a < 0 (Figure 7). 
You can easily check the following formulas: 


(b) 
FIGURE 6 Gp) = (abjayv, 
ate—. 
037 = 0; 
—l-v=-—v. 


Notice that we have only defined a product of a number and a vector, we have 
not defined a way of ‘multiplying’ two vectors, to get another vector.* However, 
there are various ways of ‘multiplying’ vectors to get numbers, which are explored 
in the following problems. 


PROBLEMS 


1. Given a point v of the plane, let Rg(v) be the result of rotating v around the 
origin through an angle of 6 (Figure 8). ‘The aim of this problem is to obtain 


FIGURE 7 a formula for Rg, with minimal calculation. 
(a) Show that 


RoC, OP= (cas OF sim), [we should really write Rg((1, 0)), ete.] 
RAO, Ty = (sin @acos ¢). 


(b) Explain why we have 


Re(v + w) = Ro(v) + Re(w), 
Re(a-w) =a: Rai). 


(c) Now show that for any point (x, y) we have 
Re(x, y) = (x cos@ — ysind, xsin@ + ycos@). 
*Tf you jump to Chapter 25, you'll find that there is an important way of defining a product, but 


this is something very special for the plane—it doesn’t work for vectors in 3-space, for example, even 
FIGURE 8 though the other constructions do. 
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(d) Use this result to give another solution to Problem 4-23. 
Given v and w, we define the nwnber 
Dw) — Ue Ut 


this is often called the ‘dot product’ or ‘scalar product’ of v and w (‘scalar’ 
being a rather old-fashioned word for a number, as opposed to a vector). 


(a) Given v, find a vector w such that v-w = 0. Now describe the set of all 
such vectors w. 
(b) Show that 


vew=weov 
Dp (Qi oe) eth ea se 


and that 
a> wi“ Vy ew = - (a: wy. 


Notice that the last of these equations involves three products: the dot 
product + of two vectors; the product - of a number and a vector; and 
the ordinary product - of two numbers. 

(c) Show that v-v > 0, and that v- v = 0 only when v = O. Hence we can 
define the norm ||v|| as 


|| v || = /v0-v, 


which will be 0 only for v = O. What is the geometric interpretation of 
the norm? 
(d) Prove that 
|v + wll < [lull + [wll 


and that equality holds if and only if v = 0 or w = 0 or w =a - v for 
some number a > 0. 

(ec) Show that 

lu + wl? — |jv- wil? 


4 


(a) Let Re be rotation by an angle of @ (Problem 1). Show that 


Veh = 


Re(v) +> Re(w) =v-w. 


(b) Let e = (1,0) be the vector of length | pointing along the first axis, and 
let w = (cos@, sin @); this is a vector of length | that makes an angle of 6 
with the first axis (compare Problem 1), Calculate that 

Gl! — (eos 0: 
Conclude that in general 


Ve w = |p| - lw] -cosé, 


where 6 is the angle between v and w. 


FIGURE 9 


|| w || - sin O 


TG Rae LO 
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Given two vectors v and w, we'd expect to have a simple formula, involving 
the coordinates v1), v2, wy, w2, for the area of the parallelogram they span. 
Figure 9 indicates a strategy for finding such a formula: since the triangle 
with vertices w, A, v + w is congruent to the tangle OBv, we can reduce 
the problem to an easier one where one side of the parallelogram lies along 
the horizontal axis: 
(a) ‘The line L passes through v and 1s parallel to w, so has slope w2/wy. 
Conclude that the point B has coordinate 
(By Ue!) —— vey 


w 
and that the parallelogram therefore has area 
det(v, w) = vy w2 — wy v2. 
This formula, which defines the determinant det, certainly seems to be simple 


enough, but it can’t really be true that det(v, w) always gives the area. After 
all, we clearly have 


det(w, v) = — det(v, w), 


so sometimes det will be negative! Indeed, it is easy to see that our “deriva- 

tion” made all sorts of assumptions (that w2 was positive, that B had a positive 

coordinate, etc.) Nevertheless, it seems likely that det(v, w) is + the area; the 
next problem gives an independent proof. 

(a) If v points along the positive horizontal axis, show that det(v, w) is the 
area of the parallelogram spanned by v and w for w above the horizontal 
axis (w2 > O), and the negative of the area for w below this axis. 

(b) If Rg is rotation by an angle of 6 (Problem 1), show that 


det(Rev, Rew) = det(v, w). 


Conclude that det(v, w) is the area of the parallelogram spanned by 
v and w when the rotation from v to w is counterclockwise, and the 
negative of the area when it is clockwise. 


Show that 


det(v, w + z) = det(v, w) + det(v, z) 
det(v + w, z) = det(v, z) + det(w, z 


and that 
a det(v. w) = det(a - v, w) = det(v, a- w). 


Using the method of Problem 3, show that 
det(v, w) = |lv|] - ||w i] - sin @, 


which is also obvious from the geometric interpretation (Figure 10). 
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APPENDIX 2. THE CONIC SECTIONS 


Although we will be concerned almost exclusively with figures in the plane, 
defined formally as the set of all pairs of real numbers, in this Appendix we want 
to consider three-dimensional space, which we can describe in terms of triples of 
real numbers, using a “three-dimensional coordinate system,” consisting of three 
straight lines intersecting at right angles (Figure 1). Our horizontal and vertical axes 
now mutate to two axes in a horizontal plane, with the third axis perpendicular to 
both. 

One of the simplest subsets of this three-dimensional space is the (infinite) cone 
illustrated in Figure 2; this cone may be produced by rotating a “generating line,” 
of slope C say, around the third axis. 


FIGURE 2 
For any given first two coordinates x and _y, the point (x, y, 0) in the horizontal 
. 4) OM x . 
plane has distance Vx" + y~ from the origin, and thus 
/ . on >, a, 9 
(1) (ay) is Gn thegone 1 and only iz —=—=Cvx y= 


We can descend from these three-dimensional vistas to the more familiar two- 
dimensional one by asking what happens when we intersect this cone with some 
plane P (Figure 3). 


IEG WAT 


If the plane is parallel to the horizontal plane, there’s certainly no mystery-~ the 
intersection is just a circle. Otherwise, the plane P intersects the horizontal plane 


slope —C 


FIGURE 4 
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in a straight line. We can make things a lot simpler for ourselves if we rotate 
everything around the vertical axis so that this mtersection line points straight out 
from the plane of the paper, while the first axis is in the usual position that we 
are familiar with. ‘The plane P is thus viewed “straight on,” so that all we see 
(Figure 4) is its intersection L with the plane of the first and third axes; from this 
view-point the cone itself simply appears as two straight lines. 

If this line L happens to be vertical, consisting of all poimts (a, z) for some a, 
then equation (1) says that the intersection of the cone and the plane consists of 
all points (a, y, z) with 

2 — Cy? = C2a?, 


which is an hyperbola. 
Otherwise, in the plane of the first and third axes, the line L can be described 
as the collection of all pomts of the form 


(x,Mx+ B), 
where M is the slope of L. For an arbitrary point (x, y, z) it follows that 
(2) (x.y, Z) 1S instilte plane P if and only i, 7S iA, + B. 


Combining (1) and (2), we see that (x, y, z) is in the intersection of the cone and 
the plane if and only if 


(+) Mx+B=+Cvx? + y’. 


Now we have to choose coordinate axes in the plane P. We can choose L as the 
first axis, measuring distances from the intersection Q with the horizontal plane 
(Figure 5); for the second axis we just choose the line through Q parallel to our 
original second axis. If the first coordinate of a point in P with respect to these 
axes is x, then the first coordinate of this pomt with respect to the original axes 
can be written in the form 


ax + B 
for some @ and f. On the other hand, if the second coordinate of the point with 
respect to these axes is y, then y is also the second coordinate with respect to the 
original axes. 
Consequently, (*) says that the point lies on the intersection of the plane and the 
cone if and only if 


M(ax + p)+B=+CV(ax+)*+y’. 


Although this looks fairly complicated, after squaring we can write this as 
Cure — a*(M* — C?)x? + Ex+F=0 


for some FE and F that we won't bother writing out. 
Now Problem 4-16 indicates that this is either a parabola, an ellipse, or an 
hyperbola. In fact, looking a little more closely at the solution, we see that the 
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values of E and F are irrelevant: 


(1) If M = +C we obtain a parabola; 

(2) If C? > M? we obtain an ellipse; 

(3) If C? < M? we obtain an hyperbola. 

‘These analytic conditions are easy to interpret geometrically (Figure 6): 

(1) If our plane is parallel to one of the generating lines of the cone we obtain 
a parabola; 

(2) If our plane slopes less than the generating Ine of the cone (so that our 
intersection omits one half of the cone) we obtain an ellipse; 

(3) If our plane slopes more than the generating line of the cone we obtain an 
hyperbola. 


hg 


FIGURE 6 


In fact, the very names of these “conic sections” are related to this description. 
The word parabola comes from a Greek root meaning ‘alongside,’ the same root 
that appears in parable, not to mention paradigm, paradox, paragon, paragraph, 
paralegal, parallax, parallel, and even parachute. £//ipse comes from a Greck root 
meaning ‘defect, or omission, as in ellipsis (an omission, ... or the dots that in- 
dicate it). And hyperbola comes from a Greek root meaning ‘throwing beyond,’ or 
excess. With the currency of words like hyperactive, hypersensitive, and hyperven- 
tilate, not to mention hype, one can probably say, without risk of hyperbole, that 
this root is familiar to almost everyone.* 


PROBLEMS 


1. Consider a cylinder with a generator perpendicular to the horizontal plane 

(igure 7); the only requirement for a point (x, y,z) to lie on this cylinder is 

* Although the correspondence between these roots and the geometric picture correspond so beau- 

tifully, for the sake of dull accuracy it has to be reported that the Greeks originally apphed the words 
to describe features of certaim equations involving the conic sections. 
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that (x, y) hes on a circle: 
5) 3) 4h 
bee ae (Cae 


Show that the intersection of a plane with this cylinder can be described by 
an equation of the form 
9 2 9) 
(ax + B)° + y" =C°. 
What possibilities are there? 


In Figure 8, the sphere 5; has the same diameter as the cylinder, so that its 
equator Cy lies along the cylinder; it is also tangent to the plane P at F;. 
Similarly, the equator Cz of Sz lies along the cylinder, and 5S} is tangent to P 
at Fo. 


(a) Let z be any point on the intersection of P and the cylinder. Explain 
why the length of the line from z to Fyis equal to the length of the vertical 
fine & fron’ to’ Cye 

(b) By proving a similar fact for the length of the line from z to Fy, show that 
the distance from z to F, plus the distance from z to F> is a constant, so 
that the intersection is an ellipse, with foci Fy and Fy. 


Similarly, use Figure 9(a) to prove geometrically that the intersection of a 
plane and a cone is an ellipse when the plane intersects just one half of the 


cone. Similarly, use (b) to prove that the intersection is an hyperbola when 
the plane intersects both halves of the cone. 


FIGURE 9 
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APPENDIX 3. POLAR COORDINATES 


In this chapter we’ve been acting all along as if there’s only one way to label 
points in the plane with pairs of numbers. Actually, there are many different 
ways, each giving rise to a different “coordinate system.” The usual coordinates 
of a poit are called its cartesian coordinates, after the French mathematician 
and philosopher Rene Descartes (1596-1650), who first introduced the idea of 
coordinate systems. In many situations it Is more convenient to introduce polar 
coordinates, which are illustrated in Figure |. To the point P we assign the polar 
coordinates (7,9), where r is the distance from the origin O to P, and 6@ is the 
measure, in radians, of the angle between the horizontal axis and the line from 
O to P. This @ is not determined unambiguously. For example, points on the 
right side of the horizontal axis could have either 6 = 0 or 6 = 27; moreover, 6 
is completely ambiguous at the origin O. So it is necessary to exclude some ray 
through the origin if we want to assign a unique pair (7,@) to each point under 
consideration. 


On the other hand, there is no problem associating a unique point to any pair 
(r,@). In fact, it is possible (though not approved of by all) to associate a point 
to (7,8) when r < 0, according to the scheme indicated in Figure 2. Thus, it 
always makes sense to talk about “the point with polar coordinates (r,@),” (with 
or without the possibility of r < 0), even though there is some ambiguity when we 
talk about “the polar coordinates” of a given point. 


length r 


P is the point with polar coordinates (r, 91) 
and also the point with polar coordinates 
C7iGa)s 


LEIGH Wid So) Ba 


It is clear from Figure | (and Figure 2) that the pomt with polar coordinates 
(r,@) has cartesian coordinates (x, y) given by 


% =r cose, y=7 sino 


FIGURE 3 


(a) 


(b) 


FIGURE 5 
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Conversely, if a point has cartesian coordinates (x, y), then (any of) its polar co- 
ordinates (r, @) satisfy 


rotvx?+ y? 


iand = => it x 4 0) 
x 


Now suppose that f is a function. Then by the graph of f in polar co- 
ordinates we mean the collection of all pomts P with polar coordinates (r, @) 
satisfying r = f(@). In other words, the graph of f in polar coordinates is the 
collection of all points with polar coordinates (f(@), 6). No special significance 
should be attached to the fact that we are considering pairs (f(@), 0), with f (0) 
first, as opposed to pairs (x, f(x)) in the usual graph of /; it is purely a matter of 
convention that r is considered the first polar coordinate and @ is considered the 
second. 

The graph of f in polar coordinates is often described as “the graph of the 
equation r = f(@).” For example, suppose that f is a constant function, f(@) =a 
for all 6. The graph of the equation r = a is simply a circle with center O and 
radius a (Figure 3). This example illustrates, in a rather blatant way, that polar 
coordinates are likely to make things simpler in situations that involve symmetry 
with respect to the origin O. 

The graph of the equation r = @ is shown in Figure 4. ‘The solid line corresponds 
to all values of 6 > 0, while the dashed line corresponds to values of 6 < 0. 


_ 


FIGURE 4 Spiral of Archimedes 


As another example involving both positive and negative r, consider the graph of 
the equation r = cos @. Figure 5(a) shows the part that corresponds to 0 < @ < 2/2 
Figure 5(b) shows the part corresponding to 7/2 < 6 < 2; here r < 0. You can 
check that no new points are added for 6 > a or 6 < 0. It is easy to describe 
this same graph in terms of the cartesian coordinates of its points. Since the polar 
coordinates of any point on the graph satisfy 


ri} Gos GF 


86 Foundations 


and hence 


its cartesian coordinates satisfy the equation 
2 2 
x" +y =x 


which describes a circle (Problem 4-16). [Conversely, it is clear that if the cartesian 
coordinates of a point satisfy x? + y? =x, then it lies on the graph of the equation 
r = cos6.] 

Although we've now gotten a circle in two different ways, we might well be 
hesitant about trying to find the equation of an ellipse in polar coordinates. But 
it turns out that we can get a very nice equation if we choose one of the foci as 
the origin. Figure 6 shows an ellipse with one focus at O, with the sum of the 
distances of all points from O and the other focus f being 2a. We've chosen f to 
the left of O, with coordinates written as 


(—2ea,0). 


(We have 0 < ¢ < 1, since we must have 2a > distance from f to O),. 


(x, y) 


FIGURE 6 
The distance r from (x, y) to O is given by 
(1) rsx? t+ y?. 
By assumption, the distance from (x, y) to f is 2a — r, hence 
(2a —r)* = (x — [—2ea])* + y’, 
or 
(2) 4a* — 4ar +r? = x* + 4eax + 467a? + ye: 
Subtracting (1) from (2), and dividing by 4a, we get 


4 
a-r=é&x+€& a, 
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or 


9 
=a—éx —€&a 
9 
= (1 —eé~)a — ex, 
which we can write as 
: 5 
(3) r=A—ex, for A = (1 —e*)a. 


\ 


Substituting r cos@ for x, we have 


r=A-—ercos#, 
r(1+ecos@) = A, 


and thus 


A 
(4) r= ————_.. 
1 +ecosé 
In Chapter 4 we found that 
x2 y2 
(5) a 


is the equation in cartesian coordinates for an ellipse with 2a as the sum of the 
distances to the foci, but with the foci at (—c,0) and (c, 0), where 


Since the distance between the foci is 2c, when this ellipse is moved left by c 
units, so that the focus (c, 0) is now at the origin, we get the ellipse (4) when we 
take c = €a or € = c/a (with equation (3) determining A). Conversely, given 
the ellipse described by (4), for the corresponding equation (5) the value of a is 
determined by (3), 


and again using c = €a, we get 


b = Va? —c? = Va? — £2a2 =aVv1l—&2 = =. 
V1—e? 


Thus, we can obtain a and b, the lengths of the major and minor axes, immediately 
from ¢ and A. 


The number 


a a a 


the eccentneity of the ellipse, determines the “shape” of the ellipse (the ratio of the 
major and minor axes), while the number A determines its “size,” as shown by (4). 
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PROBLEMS 
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FIGURE 7 


If two pomts have polar coordinates (r;, 6)) and (rz, 62), show that the dis- 
tance d between them is given by 


d* = fie 4 ro” — 2r\r2 cos(O; — 42). 


What does this say geometrically? 
Describe the general features of the graph of f in polar coordinates if 


(ee isseven 
(ii) f is odd. 
(ii) (0) = f@+z). 


Sketch the graphs of the following equations. 


Cea c sind’. 

(ii) + =asec@. Hint: It is a very simple graph! 
(in) r = cos 20. Good luck on this one! 

vie COS 30. 

Cie.) 60s 20]. 

(je —slicos 30 |. 


Find equations for the cartesian coordinates of points on the graphs (i), (11) 
and (i) in Problem 3. 


Consider a hyperbola, where the difference of the distance between the two 
foci is the constant 2a, and choose one focus at O and the other at (—2ea, 0). 
(In this case, we must have ¢ > 1). Show that we obtain the exact same 
equation in polar coordinates 
A 
r= ——— 
1+ ecosé 


as we obtained for an ellipse. 


Consider the set of points (x, y) such that the distance (x, y) to O is equal to 

the distance from (x, y) to the line y = a (Figure 7). Show that the distance 

to the line is a — r cos 6, and conclude that the equation can be written 
a=r() eose 

Notice that this equation for a parabola is again of the same form as (4) 


/ 


Now, for any A and ¢, consider the graph in polar coordinates of the equa- 
tion (4), which implies (3). Show that the pomits satisfying this equation satisfy 


(l — e*) x? + x — A* — 2Aex. 


Using Problem 4-16, show that this is an ellipse for ¢ < 1, a parabola for 
€ = |, and a hyperbola for e > I. 
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8. (a) Sketch the graph of the cardioid r = 1 — sin @. 
(b) Show that it is also the graph of r = —1 — sin@. 
(c) Show that it can be described by the equation 


0) 9 3 9 
Xp YS Va" yy, 


and conclude that it can be described by the equation 
2 2 2 2 2 
(x°- yy + y) =x + y 
9. Sketch the graphs of the following equations. 
i) r=1—4siné. 


(i) r=1—2siné@. 
(iil) r=2+ cos. 


10. (a) Sketch the graph of the /emniscate 
9 9 
r= Ja™cos 28. 


(b) Find an equation for its cartesian coordinates. 
(c) Show that it is the collection of all points P in Figure 8 satisfying 
2; 
d\d> =a. 
(d) Make a guess about the shape of the curves formed by the set of all P 
S paSig ee ? ? 
satistying d}d2 = b, when b > a~.and when b <a>. 


FIGURE 8 


CHAPTER 


PROVISIONAL DEFINITION 


LIMITS 


The concept ofa limit is surely the most important, and probably the most difficult 
one in all of calculus. ‘The goal of this chapter is the definition of limits, but we 
are, once more, going to begin with a provisional definition; what we shall define 
is not the word “init” but the notion of a function approaching a limit. 


The function f approaches the limit / near a, if we can make f(x) as close as we 
like to / by requiring that x be sufficiently close to, but unequal to, a. 


Of the six functions graphed in Figure I, only the first three approach / at a. 
Notice that although g(a) is not defined, and /(a) is defined “the wrong way,” it 
is still true that g and fh approach / near a. This is because we explicitly ruled 
out, in our definition, the necessity of ever considering the value of the function 
at ait is only necessary that f(x) should be close to / for x close to a, but unequal 
toa. We are simply not interested in the value of f(a), or even in the question of 
whether f(a) is defined. 


fe f | J. (ae 7, 
__ re ao 
a a 


a 
— 
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FIGURE |] 


One convenient way of picturing the assertion that f approaches / near a is 
provided by a method of drawing functions that was not mentioned in Chapter 4. 
In this method, we draw two straight limes, each representing R, and arrows from 
a point x in one, to f(x) in the other. Figure 2 illustrates such a picture for two 
different functions. 


90) 


5. Limits 91 


c - —1 0 1 2 
| 
=2 —| 0 1 
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FIGURE 2 


Now consider a function f whose drawing looks like Figure 3. Suppose we ask 
that f(x) be close to /, say within the open interval B which has been drawn 
in Figure 3. ‘This can be guaranteed if we consider only the numbers x in the 
interval A of Figure 3. (In this diagram we have chosen the largest interval which 
will work; any smaller interval containing a could have been chosen instead.) Ifwe 
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FIGURE 3 FIGURE 4 


choose a smaller interval B’ (Figure 4) we will, usually, have to choose a smaller A’, 
but no matter how small we choose the open interval B, there is always supposed 
to be some open interval A which works. 

A similar pictorial interpretation is possible in terms of the graph of f, but in 
this case the interval B must be drawn on the vertical axis, and the set A on the 
horizontal axis. The fact that f(x) isin B when x is in A means that the part of the 
graph lying over A is contained in the region which is bounded by the horizontal 
lines through the end points of B; compare Figure 5(a), where a valid interval A 
has been chosen, with Figure 5(b), where A is too large. 


(a) 


FIGURE 5 
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FIGURE 6 


‘To take a specific simple example, let’s consider the function f(«) = 3x with 
a =5 (Figure 6). Presumably f should approach the limit 15 near 5—we ought 
to be able to get f(x) as close to 15 as we like if we require that x be sufficiently 
close to 5. ‘To be specific, suppose we want to make sure that 3x is within ai of 
15. ‘This means that we want to have 


| | 
ano 


10° 
which we can also write as 
| 
eS ee 
Top ae ~ 10 
‘To do this we just have to require that 
] ] 
mae = 
V0 


or simply |x —5| < 0: ‘There is nothing special about the number bb: It is just as 


easy to guarantee that |3x — 15| < Ww: simply require that |x — 5| < wi. In fact, 
if we take any positive number € we can make [3.x — 15| < € simply by requiring 
that |x — 5] < &/3. 

There’s also nothing special about the choice a = 5. It’s just as easy to see that 
f approaches the mit 3a at a for any a: To ensure that 


[3x —3a| <e 


we just have to require that 


a ee 
Xx a| < =. 
3 


Naturally, the same sort of argument works for the function f(«) = 3,000,000... 
We just have to be 1,000,000 tmes as careful, choosing |x — a| < ¢/3,000,000 in 
order to ensure that | f(x) — a| < e. 

The function, 66.) = x? is a little more interesting, Presumably, we should be 
able to show that f(x) approaches 9 near 3. This means that we need to show 
how to ensure the inequality 


Ix? —9\<e 


for any given positive number ¢ by requiring |x — 3] to be small enough. ‘The 
obvious first step 1s to write 


ie 9) =r — Sie et 31. 


which gives us the useful |x — 3] factor. Unlike the situation with the previous 
, which isn’t a convenient constant 


examples, however, the extra factor here is |x +3 
like 3 or 3,000,000. But the only crucial thing is to make sure that we can say 
something about how big |x + 3] is. So the first thing we'll do is to require that 
|x — 3] < 1. Once we've specified that |» —3] < 1, or 2 < x <4, we have 
5 <x+3 <7 and weve guaranteed that |x + 3| < 7. So we now have 


i” = Oar — 8] - eC 
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which shows that we have |x? — 9| < e€ for |x — 3| < ¢/7, provided that we've 
also required that |x — 3| < 1. Or, to make it look more official: we require that 
|x — 3| < min(e/7, 1). 

The inital specification |x — 3] < 1 was simply made for convenience. We 


could just as well have specified that |x — 3] < is orale — 3] 10) oy ain other 


convenient number. ‘To make sure you understand the reasoning in the previous 
paragraph, it is a good exercise to figure out how the argument would read if we 
chose |x — 3| < 10. 

Our argument to show that f approaches 9 near 3 will basically work to show 
that f approaches a? near a for any a, except that we need to worry a bit more 
about getting the proper inequality for |x + a|. We first require that |x — a| < 1, 
again with the expectation that this will ensure that |. + a] is not too large. In 
fact, Problem 1-12 shows that 


Ix| — la] < |x -—a| < 1, 


sO 


Ix|< 1+ al, 


and consequently 


lx +a] < |x| + la| < 2Ja|+ 1, 


so that we then have 


< |x —a|-(Qla|+ 1), 


which shows that we have |x*—a?| < ¢ for |x —a| < €/(2\a|+ 1), provided that we 
also have |x — a| < 1. Officially: we require that |x — a| < min(e/(2|a| + 1), 1). 

In contrast to this example, we'll now consider the function f(x) = 1/x (for 
x #0), and try to show that f approaches 1/3 near 3. This means that we need 
to show how to guarantee the mequality 


for any given positive number € by requiring |x — 3| to be small enough. We begin 
by writing 


giving us the nice factor |x —3 


, and even an extra } for good measure, along with 
the problem factor 1/|x|. In this case, we first need to make sure that |x| isn’t too 
small, so that 1/|x| won't be too large. 


We can first require that |x — 3| < 1, because this gives 2 < x < 4, so that 


ee 


I 
<< 
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possible values for x 


a a 
2 2 
a=) a>0O 
Taal 


(E) Ssesn = 
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I oh nea 
which not only tells us that — < a but also that x > 0, which is important in 
eee: 


] l 
order to conclude that ree We now have 
l ] 1 1 ] 
——-|= -=-—-|x-3 —(|x —3|, 
x q See | 2 si zal | 


which shows that we have |1/x — 1/3] < ¢ for |x — 3| < 6e, provided that we’ve 
also required that |x — 3] < 1. Or, to make it look official again: we require that 
[x — 3| < min(e, 1). 

If we instead wanted to show that f approaches —1/3 near —3, we would begin 
by stipulating that |x — (—3)| < 1, giving —4 < x < —2, once again implying that 
[1 /x| < 1/2, so that everything works as before. 

To show in general that f approaches 1/a near a for any a we proceed in 
basically the same way, except that, again, we have to be a little more careful 
in formulating our initial stipulation. It’s not good enough simply to require that 
x —a| should be less than 1, or any other particular number, because if a is close to 
0 this would allow values of x that are negative (not to mention the embarrassing 
possibility that x = 0, so that f(x) isn’t even defined)). 

The trick in this case is to first require that 


la| 
lx —al < a 


— 


in other words, we require that x be less than half as far from 0 as a (Figure 7). 
You should be able to check first that x 4 0 and that 1/|x| < 2/la|, and then work 
out the rest of the argument. 

With all the work required for these simple examples, you may have begun to 
quail at the prospect of tackling even more complicated functions. But that won't 
really be necessary, since we will eventually have some basic theorems that we can 
rely on. Instead of worrying about the unpleasant algebra that might be involved 
inefunctons hkes/ (7) = x? or Pi Cou= oe we ll turn our attention to some 
examples that might appear to be even more frightening. 

Consider first the function f(x) = xsin1/x (Figure 8). 
behavior of this function near 0 it is clear, at least intuitively, that f approaches 


Despite the erratic 


/ = 0 near a = 0 (remember that our provisional definition specifically exempts 
x =a from consideration, so it doesn’t matter that this function isn’t even defined 
at 0). We want to show that we can get f(x) = xsin 1/x as close to 0 as desired 
if we require that x be sufficiently close to 0, but 4 0. In other words, for any 
uumber € > 0, we want to show that we can ensure that 


lf) = 0] = 


< € 


x sin — 
Bs 
by requiring that |x| = |4 —O| is sufficiently small (but 4 0). But this is easy. Since 


<i opr alice = (0 


FIGURE 9 


FIGURE 10 
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we have 


< |x|, for all x 4 0, 


x sin — 


so we can make |x sin 1/x| < ¢ simply by requiring that |x| < e and ¥ 0. 


. ~ . - Die 7. . 
For the function f(x) = x- sin 1/x (Figure 9) it seems even clearer that f ap- 
proaches 0 near 0. If, for example, we want 


then we certainly need only require that |x| < i and x # QO, since this implies 


é 
that |x-| < wa and consequently 


5 ated 
x" sin — 
Xx 


] l 
—— < —. 
100.10 


< |x?| < 


(We could do even better, and allow |x| < 1/V10 and x ¥ O, but there is no 
particular virtue in being as economical as possible.) In general, if ¢ > 0, to 
ensure that 


we need only require that 
Ix|<e and x £0, 


provided that e < 1. If we are given an ¢ which is greater than | (it might be, even 
though it is “small” ¢’s which are of interest), then it does not suffice to require 
that |x| < e, but it certainly suffices to require that |x| < 1 and x 4 0. 


As a third example, consider the function f(x) = V/|x|sin 1/x (Figure 10). In 
order to make | |x| sin 1/x| < & we can require that 


Ix]}<e? and x0 


(the algebra is left to you). 
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Finally, let us consider the function f(+) = sin 1/x (Figure 11). For this function 
it 1s false that f approaches 0 near 0. This amounts to saying that it is not true 
for every number ¢ > 0 that we can get | f(x) — O| < ¢ by choosmg x sufficiently 
small, and 4 0. ‘To show this we simply have to find one ¢ > O for which the 
condition | f(x«) — O| < ¢ cannot be guaranteed, no matter how small we require 
|wistotbe. elm fact. ¢ = 5 will do: it is impossible to ensure that | f(x)| < 1 no 


> 
matter how small we require |x| to be; for if A is any interval containing 0, there 


is some number x = 1/(5 + 2nz) which is in this interval, and for this x we have 


f@S2 


nil 


| 
(i 
I 
| 
| 
| 
' 
| 
| 
a | 
Ee = 
tlm 
I 
I 
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I 
if 
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if 
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FIGURE I] 


This same argument can be used (Figure 12) to show that f does not approach 
any number near 0. ‘To show this we must again find, for any particular number J, 
some number € > O so that | f(«) —/| < € is not true, no matter how small x is 
required to be. The choice ¢ = 5 works for any number /; that is, no matter how 
small we require |x| to be, we cannot ensure that | f(x) —/| < 5. The reason ts, 
that for any mterval A containing 0 we can find both x; and x2 in this interval 


with 
FIGURE 12 fi) =1 and f@o2) =—-1, 
namely 
| ] 
x)= 5 aud Oe 
sm + 2n0 sx + 2mm 


for large enough 2 and m. But the interval from / — to (ee t cannot contam 
both —1 and I, since its total length is only 1; so we cannot have 


ite cre) MM ple 


ll —l| < 


tol — 


no matter what / is. 
The phenomenon exhibited by f (4) = sin 1/x near 0 cau occur m many ways. 
If we consider the function 
OQ, x irrational 


To) — 


). x Yational 


TGR S13 


coe x > () 


ya 1,0a< 0 
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then, no matter what a is, f does not approach any number / near a. In fact, we 
cannot make | f(x) —/| < no matter how close we bring x to a, because in any 
interval around a there are numbers x with f(x) = 0, and also numbers x with 
f(x) = 1, so that we would need |0 —?| < y and also |I —/| < ie 

An amusing variation on this behavior is presented by the function shown in 
cea ue ge ho 

A eg rational 

0, «x irrational. 

The behavior of this function is “opposite” to that of g(x) = sin I/x; it ap- 
proaches 0 at 0, but does not approach any number at a, if a@ 4 0. By now you 
should have no difficulty convincing yourself that this is true. 


We conclude with a very simple example (Figure 14): 


=a. «+ 20 


Looe Ly ee 2 IO) 


If a > 0, then f approaches | near a: indeed, to ensure that | f() — 1] < é it 
certainly suflices to require that |x — a| <a, since this implies 


=a =i ae 
or ee 


so that f(x) = |. Simularly, if b < 0, then f approaches —1 near b: to ensure 
that | f(x) — (—1)| < € it suffices to require that |x — b| < —b. Finally, as you may 
easily check, f does not approach any number near 0. 


The time has now come to pomt out that of the many demonstrations about 
limits which we have given, not one has been a real proof. The fault hes not 
with our reasoning, but with our definition. If our provisional definition of a 
function was open to criticism, our provisional definition of approaching a limit 
is even more vulnerable. ‘This definition is simply not sufficiently precise to be 
used in proofs. It is hardly clear how one “makes” f(x) close to / (whatever 
“close” means) by “requiring” x to be sufficiently close to a (however close “sufh- 
ciently” close is supposed to be). Despite the criticisms of our definition you may 
feel (I certainly hope you do) that our arguments were nevertheless quite convinc- 
ing. In order to present any sort of argument at all, we have been practically forced 
to invent the real definition. It is possible to arrive at this definition in several steps, 
each one clarifying some obscure phrase which stll remains. Let us begin, once 
again, with the provisional definition: 


The function f approaches the limit / near a, if we can make f(x) as close 
as we like to / by requiring that x be sufficiently close to, but unequal to, a. 


‘The very first change which we made tn this defmition was to note that making 
f(x) close to / meant making |f (+) —/| small, and similarly for x and a: 
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DEFINITION 


The function f approaches the limit / near a, if we can make | f (+) —/| as 
small as we like by requiring that |« — a| be sufficiently small, and x 4 a. 


The second, more crucial, change was to note that making | f(*) —/| “as small as 
we hke” means making | f(x) —/| < e for any ¢ > 0 that happens to be given us: 


The function f approaches the limit / near a, if for every number ¢ > O we 
can make | f(x) —/| < ¢ by requiring that |x — a| be sufficiently small, and 


Xo Ge 


‘There is a common pattern to all the demonstrations about limits which we have 
given. For each number ¢ > 0 we found some other positive number, 6 say, with 
the property that if x #4 a and |x —a| < 6, then | f(x) —7| < e. For the function 
f@) = xswl/x With a = 0,2 = 0), the number 6 was justahé number =; 
Ke JAC — ve sin 1/x, it was 67; for f(x) = x? it was the minimum of | and 
€/(2|a| + 1). In general, it may not be at all clear how to find the number 4, 
given €, but it is the condition |x —a| < 6 which expresses how small “sufficiently” 
small must be: 


The function f approaches the limit / near a, if for every ¢ > 0 there is some 
6 > 0 such that, for all x, if |x —a| < 6 and x £a, then |f(x) —| <e. 


This is practically the definition we will adopt. We will make only one trivial 
change, noting that “* 
lx —a| <6.” 


x —a| < 6 and x 4 a” can just as well be expressed “O < 


The function f approaches the limit / near a means: for every ¢ > O there 


is some 6 > QO such that, for all x, if O < |x —a| <6, then | f(x) —/| <e. 


This definition is so important (everything we do from now on depends on it) that 
proceeding any further without knowing it is hopeless. If necessary memorize it, 
like a poem! That, at least, is better than stating it incorrectly; if you do this you 
are doomed to give incorrect proofs. A good exercise in giving correct proofs is to 
review every fact already demonstrated about functions approaching limits, giving 
formal proofs of each. In most cases, this will merely mvolve a bit of rewording 
to make the arguments conform to our formal definition — all the algebraic work 
has been done already. When proving that f does nod approach / at a, be sure to 
negate the definition correctly: 


If it is wo true that 


for every € > 0 there is some 6 > 0 such that, for all x, if O < |x —a| < 4, 


then | f(x) —1] <e, 


then 
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there is some € > O such that for every 5 > O there is some x which satisfies 
O < |x —a| <6 but not |f(x) -—1| <e. 


Thus, to show that the function f(x) = sin 1/x does not approach 0 near 0, we 
consider ¢ = 5 and note that for every 6 > 0 there is some x with 0 < |x —O| < 6 
but not | sin 1/x —0| < 4—namely, an x of the form 1/(7/2 + 27), where 7 is 
so large that 1/(@/2 + 2nn) <6. 

As a final illustration of the use of the definition of a function approaching a 
limit, we have reserved the function shown in Figure 15, a standard example, but 
one of the most comphcated: 


0, * irrational, 0 =x = | 
Vo, x«— pg im lowest terms, 0 <x = |. 


fx)= 


(Recall that p/q is in lowest terms if p and q are integers with no common factor 
and q > 0.) 


5 e 0, x irrational 
fa=y; 1 De 
—, x = — in lowest terms 
q q 
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For any number a, with 0 <a < 1, the function f approaches 0 at a. To prove 
this, consider any number ¢ > 0. Let 7 be a natural number so large that 1/n < e. 
Notice that the only numbers x for which | f(x) — O| < € could be false are: 


ee eI eae l n— | 


5 


4 ASS) Ss Pe tee be 

(If @ is rational, then a might be one of these numbers.) However many of these 
numbers there may be, there are, at any rate, only finitely many. Therefore, of all 
these numbers, one ts closest to a; that is, |p/q —a| 1s smallest for one p/q among 
these nuinbers. (If a happens to be one of these numbers, then consider only the 
values |p/q — a| for p/q #a.) This closest distance may be chosen as the 6. For 
if O < |x —a| < 6, then x is not one of 


l n—| 
oF Me n 


100 = foundations 


THEO 


| — p= 
length eng Pea 


“ “ 


PIGURE 16 


REM 1 


PROOF 


and therefore | f(x) — O| < e€ zw true. This completes the proof. Note that our 
description of the 6 which works for a given € is completely adequate— there is no 
reason why we must give a formula for 5 in terms of ¢€. 

Armed with our definition, we are now prepared to prove our first theorem; you 
have probably assumed the result all along, which is a very reasonable thing to do. 
This theorem is really a test case for our definition: if the theorem could not be 
proved, our definition would be useless. 


A function cannot approach two different limits near a. In other words, if f 
approaches / near a, and f approaches m near a, then / =m. 


Since this is our first theorem about lhmiuts it will certainly be necessary to translate 
the hypotheses according to the definition. 

Since f approaches / near a, we know that for any ¢ > 0 there is some number 
oy > O'suclvthatsior all x, 


if 0 < |x —a| < 8), then | f(x) —1| <e. 


We also know, since f approaches m near a, that there is some 62 > O such that, 
for all x, 


if 0 < |x —a@)<67,,then | f@)— m= e. 


We have had to use two numbers, 6; and 43, since there is no guarantee that the 4 
which works in one definition will work in the other. But, in fact, it is now easy to 
conclude that for any € > 0 there is some 6 > O such that, for all x, 


if 0 < |x — a| < 4, then | f(x) —/| < € and | f(x) —m| < e; 


we simply choose 6 = min(é1, 42). 
To complete the proof we just have to pick a particular e > 0 for which the two 
conditions 
(f(x) -I| <e and |f(x)-—m|<e 


cannot both hold, if / #4 m. The proper choice is suggested by Figure 16. If 
1 Am, so that | —m| > 0, we can choose {/ — m|/2 as our e. It follows that there 
is'a 0 > O'suchemat, for all x. 


|/ — m| 
oy) 


= 


: 1 = par 
afi | AGE —"ar| = 


~ 


ifO < |x —a| <4, then | f(x) -—I{ < 


‘Vhis implies that for 0 < [x — a| <6 we have 


[=| =H — fO)+ fi) —tel = ¥ — fea) — me 
l/—m| |l—m| 
<—— + ——— 


a contradiction. § 
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The number / which f approaches near a is denoted by lim f(x) (read: the limit 


Xa 


of f(x) as x approaches a). ‘This definition is possible only because of Theorem 1, 
which ensures that lim f(«) never has to stand for two different numbers. The 


poe A S| 
equation 


lim On) = 


xa 


has exactly the same meaning as the phrase 
f approaches / near a. 


The possibility still remains that f does not approach / near a, for any /, so that 
lim f(x) =/ is false for every number /. This is usually expressed by saying that 


tie tt 


‘dim, fi(x) dogs wot exist 


EX C4, 


Notice that our new notation introduces an extra, utterly irrelevant letter x, 
which could be replaced by t, y, or any other letter which does not already 
appear— the symbols 


lima f (), lim f(t), lim f(y). 
[>a 


Seg ya 


all denote precisely the same number, which depends on f and a, and has nothing 
to do with x, 7, or y (these letters, in fact, do not denote anything at all). A more 
logical symbol would be something like lim f, but this notation, despite its brevity, 


dad 
is so infuriatingly rigid that almost no one has seriously tried to use it. The notation 
lim f(x) is much more useful because a function f often has no simple name, even 


Paro f 5 
though it might be possible to express f(x) by a simple formula involving x. Thus, 
the short symbol 


a 2 . 
lim (x~ + sin.) 


X—>a 


could be paraphrased only by the awkward expression 
. a . 
hm f, Where, f(x) = x- — sit. 
a 
Another advantage of the standard symbolism is illustrated by the expressions 


lim x + Pr, 


Pal | 


antes 28 r. 


ta 
The first means the number which f approaches near a when 
: 3 y 

Fie )=1sh4-, for all: 

the second means the number which f approaches near a when 
2} 

FC) = fet or all. 

You should have little difficulty (especially if you consult Theorem 2) proving that 


lina se = mys 


Sect 


linn + go — oy, + a, 


tai 
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These examples illustrate the main advantage of our notation, which is its flex- 
ibility. In fact, the notation lim f(x) is so flexible that there is some danger of 


XS FeL 
forgetting what it really means. Here is a simple exercise in the use of this no- 
tation, which will be important later: first interpret precisely, and then prove the 
equality of the expressions 
lan f(x) and lim f(a+h). 
h>0 


Xa 


An important part of this chapter is the proof of a theorem which will make 
it easy to find many limits, as we promised long ago. ‘The proof depends upon 
certain properties of inequalities and absolute values, hardly surprising when one 
considers the definition of limit. Although these facts have already been stated in 
Problems 1-20, 1-21, and 1-22, because of their importance they will be presented 
once again, in the form ofa lemma (a lemma is an auxiliary theorem, a result that 
justifies its existence only by virtue of its prominent role in the proof of another 
theorem). ‘The lemma says, roughly, that if x is close to x9, and y is close to yo, 
then x + y will be close to x9 + yo, and xy will be close to x9y9, and 1/y will be 
close to 1/yg. ‘This intuitive statement is much easier to remember than the precise 
estimates of the lemma, and it is not unreasonable to read the proof of Theorem 2 
first, in order to see just how these estimates are used. 


Aen te 
€ € 
[inal oi) = 5 and |y — yol < 5" 
then . 
| Conny) iCrorts Yo) | =: 
(2) If 
E E 
x — xo] < min { 1, ————— ]} and _ ly — yo| < ———. 
Hanae ( 2(|vol + 7 oo 2xol + D 
then 


Ixy — xXoyo| < &. 


(3) If yo 4 0 and 


i 
Pein Iyol €lyol” 
py) YO may ; ) 
then y 40 and 
I | 
———|<e. 
or) 
(I) (x y) — (x0 + yo)| = |G = xo + O = yo)l 


lA 
ye) 


. 
[i — eo | tae OPO aca 


- 


(2) Since |x — xo| < | we have 


lx |= Evo] = |Repeerolieccnle 
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so that 
|x| < | + lxo|. 


‘Thus 


Ixy — xoyol = lx(y — yo) + yo(x — xo)| 
< |x| - ly — yol + lyol - lx — xol 


€ € 
Se SS a 9) FS 
2(|xo| + 1) 2(lyo| + 1) 
cone 8 
(3) We have 
lyo| 
Soe = yo) = 
2 
so |y| > |yo|/2. In particular, y 4 0, and 
| 2 
lvl lyol 
Thus , 
hed lyo — yl 2 L eor 
a | NEN ee =e. 
y yo lyl-lyol  lyol lyol 2 


THEOREM 2 If lim f(x) =/ and lim g(x) =m, then 
x 


+—a 


xXx=—>a 


(1) lim(f + g)@) =1 +m; 
(2) lim(f-g)(v) =1-m. 


Moreover, if m 4 0, then 


l l 
(3) lim (- (x) =—. 
La \ g m 
PROOF ‘The hypothesis means that for every ¢€ > 0 there are 5;,452 > 0 such that, for 
all x, 
if 0 < |x —a| < 4), then | f(x) -—1| < «, 
and if O < |x —a| < 59, then |g(x) — m| < «. 
This means (since, after all, ¢/2 is also a positive number) that there are 5), 52 > 0 
such that, for all x, 


if 0 < |x —a| < 6), then |f(x)—1| < = 


and if 0 < |x —a| < 4, then 


; € 
g(x) —m| < =e 


Now let 5 = min(6;, 42). If 0 < |x —a| < 6, then 0 < |x —a| < 5; and 
() < |x —a| < 49 are both true, so both 


& & 
|f(x)-1| < ji and |g(x) —m| < = 


— — 
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are true. But by part (1) of the lemma this implies that |(f + g)(x) — (1+ m)| < e. 
This proves (1). 

‘lo prove (2) we proceed similarly, after consulting part (2) of the lemma. If 
¢ > 0 there are 6), 52 > O such that, for all x, 


if 0 < |x —a| < 4), then | f(x) —/| < min (1. aan 


and if 0 < |x —a| < 49, then |g(x) —m| < 


£ 
2(\f| + 1) 
Again let 6 = min(d,, 52). If 0 < |x —a| <6, then 


& 


lf(x) -1| < mi eae 


£ 
and Do —_— 
) and = |g(x) — m| < AUN +) 


So, by the lemma, |(f - g)(x) —1-m| < ¢, and this proves (2). 
Finally, if ¢ > 0 there is a 6 > O such that, for all x, 


|m| e|m|? 


Pas 


if O < |x —a| < 6, then |g(x) — m| < min 


But according to part (3) of the lemma this means, first, that g(x) 4 0, so (1/g)(x) 
makes sense, and second that 


Sy of 


This proves (3). J 


Using ‘Theorem 2 we can prove, trivially, such facts as 


' PIs wt Ta 

im ————_— = ———_.. 

rea x24] a> +] 

without going through the laborious process of finding a 4, given an ¢. We must 
begin with 


lim 7 = 7, 
xa 
lim 1 = I, 
xX—a 
lim x =a. 
D Se al 4 | 


but these are easy to prove directly. If we want to find the 5, however, the proof of 
Theorem 2 amounts to a prescription for doing this. Suppose, to take a simpler 
example, that we want to find a 6 such that, for all x, 


if O < |x —a| < 6, then |x? Aix —(iq" +a)| <e. 
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Consulting the proof of Theorem 2(1), we see that we must first find 6; and 52 > 0 
such thatsfor all, 


oe 2 
if O < |x —a| < 6, then |x? —a|< 


w| 


€ 
and if O < |x —al| < 49, then |x —a| < = 


5" 
se : : : 2 2 ¢ ; 
Since we have already given proofs that im x7 = a~ and lim x = a, we know how 
We ae! ote 
to do this: 
€ 
2 


5) = 


‘Thus we can take 


6 = min(6), 62) = min {mm { I, 


if 0 < |x —a| <6, then 


oy bela.) 
The proof of Theorem 2(3) shows that the second condition will follow if we find 


a 6 > 0 such that, for all x, 


2 4 
if : alr ele 
if 0 < |x —a| <6, then |x*—a*| < min : “ 


— _ 


. 


Thus we can take 


a ela 
eupiar ey 5) 


Cine Le 
2\a| + 1 
Naturally, these complicated expressions for 6 can be simplified considerably, after 
they have been derived. 
One technical detail in the proof of Theorem 2 deserves some discussion. In 
order for lim f(x) to be defined it is, as we know, not necessary for f to be defined 


Ayal 
at a, nor is it necessary for f to be defined at all pomts x 4 a. However, there 
must be some 6 > 0 such that f(x) ts defined for x satisfying 0 < |x —al| < 6; 
otherwise the clause 


“af 0 < |x — al < 6, then |'f@) —7\<e” 


would make no sense at all, since the symbol f(*) would make no sense for 
some x’s. If f and g are two functions for which the definition makes sense, 
it is easy to see that the same is truce for f + g and f -g. But this is not so 
clear for 1/g, since 1/g 1s undefined for x with g(x) = 0. However, this fact gets 
established in the proof of Theorem 2(3). 
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FIGURE 18 


There are umes when we would like to speak of the limit which f approaches 
at a, even though there is no 6 > O such that f(x) is defined for x satisfying 
O < |x —a| < 6. For example, we want to distinguish the behavior of the two 
functions shown in Figure 17, even though they are not defined for numbers less 
than a. For the function of Figure 17(a) we write 


hop fiw or lint f(a 


x—>atr xa 


(The symbols on the left are read: the limit of f(¥) as x approaches a from above.) 
These “limits from above” are obviously closely related to ordinary limits, and the 
definition is very similar: lim f(x) =/ means that for every ¢ > 0 there isa 6 > 0 


x—at 


such @hiat tor all x, 
if O <x —a <4, then |f(x) -l1| <e. 


(The condition “O < x —a < 8” 1s equivalent to “O < |x — a| <6 and x > a.”) 


“Limits from below” (Figure 18) are defined similarly: lim f(x) = 1 (or 
xa 
hn fi@) = Pimeans that for every ee> O theres avd > 0 such that, for 
xta i 
all x, 


if 0 <a—x <4, then |f(x) -—/| <e. 


It is quite possible to consider limits from above and below even if f is defined 
for numbers both greater and less than a. ‘Thus, for the function f of Figure 14, 
we have 


In Ce) = and linn of i a 


x—Ot 


It is an easy exercise (Problem 29) to show that lm f(x) exists if and only if 


Demat et 


hm f(x) and lm f(x) both exist and are equal. 


yy sxe 

Like the definitions of limits from above and below, which have been smugeled 
into the text informally, there are other modifications of the limit coneept which 
will be found useful. In Chapter 4 it was claimed that if x is large, then sin I/x is 
close to 0. ‘This assertion ts usually written 


iunalisity el fte= Q. 


Vo 


The symbol lim f(x) is read “the limit of f(x) as « approaches 00,” or “as + 
v— OO 
becomes infinite.” and a limit ofthe form hm f (+) is often called a limit at infinity. 
XB 
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Figure 19 illustrates a general situation where lim f(x) =/. Formally, lim f(x) = 
x00 x00 


/ means that for every ¢ > 0 there is a number N such that, for all x, 
if x > N, then | f(x) —1| <e. 


The analogy with the definition of ordinary limits should be clear: whereas the 
condition “Q < |x — a| < 5” expresses the fact that x is close to a, the condition 
“x > N” expresses the fact that x is large. 


FIGURE 19 


We have spent so little time on limits from above and below, and at infinity, 
because the general philosophy behind the definitions should be clear if you un- 
derstand the definition of ordinary limits (which are by far the most important). 
Many exercises on these definitions are provided in the Problems, which also con- 
tain several other types of limits which are occasionally useful. 


PROBLEMS 


1. Find the following limits. (These limits all follow, after some algebraic ma- 
nipulations, from the various parts of Theorem 2; be sure you know which 
ones are used in each case, but don’t bother listing them.) 

. x= 
(1) lim. ———. 

ral x+1 
“= x? 28 
(u) hm ——. 

x32 x —2 
3 

x —8 
r—-2° 


(i) lim 
r33 
, ; xy" y" 
iv) hm ——— 
ry x—y 
| yes y” 
(v) lim ; 
yox x —y 
A _ Vvat+th—Ja 
(a) hn —_—_—__——... 
h—-+-0 h 


2. Find the following limits. 


4) lim 
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Gi) lim 
x0 Xx 
awn a Lo-vl—-x? 
(iu) lim ————-. 
x0 x 


In each of the following cases, determine the limit / for the given a, and 
prove that it is the limit by showing how to find a 6 such that | f(+) —1| < e 
for all x satisfying 0 < |x —a| <6. 
i Gh) =x [3 ~ cos(x7)], a=:0: 
(ii) f(x) =x*2+5x—2, a=2. 
- 100 
(i) f(x)= —, a=1. 
x 
(iv) f(x) =x*, arbitrary a. 


(Vv) fix)=xt+ e a=. 
x 


(va) f@) = a =0. 


See ] 
2 —sin” x 
Cat) Ch) = w/e =O, 
(vin) f(x) = J/x, a=. 
For each of the functions in Problem 4-17, decide for which numbers a the 
limit lim f(x) exists. 

+a 
(a) Do the same for each of the functions in Problem 4-19. 
(b) Same problem, if we use infinite decimals ending in a string of 0's 

instead of those ending in a string of 9's. 
Suppose the functions f and g have the following property: for all ¢ > O 


and all x, 


9 


‘ ; & ‘ 
if 0 < |x —2| < sin? (5) +e, then | f(x) — 2| < «, 
if O < |x —2| <7, then |g(x) —4| <e. 

For each e > 0 find a 5 > O such that, for all x, 


(i) aif O < |x —2| <6, then |f(x)+ g(x) —6| < «. 
(ii) if O < |x —2| <6, then |f(x)g(x) — 8| <e. 


| | 
(iti) if O < |x —2| <6, then |—— — i < 
g(x) 4 
x l 
(iv) aif O < |x —2| <6, then £) ——| <<, 
eg) 2 


Give an example of a function f for which the following assertion 1s false: 
If | f(x) —1| < € when O < |x —al| < 64, then |f(x) —/| < &/2 when 
Q < |x —a| < 8/2. 
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(a) If lim f(x) and lim g(x) do not exist, can lnm | f(x) + g(x)] exist? Can 
lim ey g(x) Bane | 

(b) If lim f(x) exists and lim [ f(x) + g(x)] exists, must tin 2(x) exist? 

(c) If lim f (x) exists and lim g (x) does not exist, can lim [ f (x)+2(x)] exist? 

(d) If tim f(x) exists al hin f(x)g(x) exists, does it follow that lim g(x) 
ates as . 

Prove that tim jhes = Hse f(a +h). (This is mainly an exercise in under- 

standing what the terms mean.) 


(a) Proveithat lim f(x) —/ if and only if lim[f(@) = #4) =0. @irstsee why 
x—a xa 
the assertion is obvious; then provide a rigorous proof. In this chapter 


most problems which ask for proofs should be treated in the same way.) 
(b) Prove that lim f(x) = lm f(@ —- a). 
x-> D, Gor af 4 


(ce). Provethat@lmiY ) = a if (x3), 
x0 x—> 
(d) Give an example where him f (x7) exists, but hm f (x) does not. 
jee > 


Suppose there is a 6 > O such that f(x) = g(x) when 0 < |x —a| < 6. Prove 
that lm f(x) = lim g(x). In other words, lim f(x) depends only on the 


xa 
values of f(x) for x near a—this fact is often expressed by saying that limits 
are a “local property.” (It will clearly help to use 6’, or some other letter, 
mstead of 6, in the definition of limits.) 
(a) Suppose that f(x) < g(x) for all x. Prove that im f(x) < lm g(x), 
Xa ASG 
provided that these limits exist. 
(b) How can the hypotheses be weakened? 
(c) If f(x) < g(x) for all x, does it necessarily follow that lm f(x) < 


. Nas Cl, 
lim g(x)? 


Suppose that f(x) < g(x) < A(x) and that Im f(x) = lim A(x). Prove that 


lim g(x) exists, and that lim g(x) = lim f(x) = lm A(x). (Draw a picture!) 
Pe ae | Boat | A 


x—a 

(a) Prove that if hm ti —seand b ~ 0, then hm ECB aa)/ obi Elin: 
Write f(bx)/x = bl f(bx)/bx]. . 

(b) What happens if b = 0? 

(c) Part (a) enables us to find lim (sin 2x)/x in terms of lim (sin.x)/x. Find 


x0 x—( 


this limit in another way. 


Evaluate the followmg limits in terms of the number a = lim (sin.x)/¥. 
- x0 


. t Sgin 3% 
G) lm 

x>0 X 

P . sinax 
Gr) = lim 


x0 sin bx 
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Li. 


18. 


19. 


*20; 


24) 8 


aD 
ea 4. cmt 25 
(1) == lim ——— 
x0 BX 
Go » 
Dee Silt: 2x 
(iv) lm ——. 
x30 X- 
. 1—cosx 
(v) = lim — ~— 
x30 a 


_, Mg tane x +2 
a 
x0 x +x° 
Tepe aSIN % 
(vn) lm ———. 
x0 | = cos x 
aa a. siIN(X + /) — sin.x 
(vill) linn ———______, 
h-O h 
; center — 1) 
(x) lm ————— 
x] x—] 
4) 5 
x°(3 + sin x) 


(x) lm —————.. 
x-0 (x + sin x)- 


1 3 
(x1) lim (x2 — 1)> sin ( ) 
rl x—| 


(a) Prove that if lim f(x) =/, then lim |[f|(x) = [/]. 


Xa 


(b) Prove that if lim f(x) = / and lim g(x) = m, then lim max(f, g)(x) = 


+a 


max(/,m) and similarly for min. 


(a) Prove that lim 1|/x does not exist, 1.e., show that lim I/x = / is false for 
x>0 


3S? 
every number J. 
(b) Prove that lm 1/(x — 1) does not exist. 
x] 


Prove that if lim f(x) = /, then there is a number 6 > 0 and a number M 


A>. 


such that | f(x)| < M if 0 < |x —a| <6. (What does this mean pictorially?) 
Hint: Why does it suffice to prove that /—1 < f(x) < /+1 for 0 < |y—a| < 5? 


Prove that af Sf G)e=' 0 tor uifrational » and) f(v)) = | for rational ™, 
then lim f(x) does not exist for any a. 

Be a(t! 
Prove that if f(x) =x for rational x, and f(x) = —~x for irrational x, then 
lim f(x) does not exist if a 4 0. 
Dia 10 | 


(a) Prove that if him g(x) = 0 thew him g(x) Si / a= 0. 
> > 
(b) Generalize this fact as follows: If lim g(x) = 0 and |A(x)| < M for all x, 
i x0 
then him gQyn (x) = 0. (Naturally it is unnecessary to do part (a) if you 
x—>( 


succeed m doing part (b); actually the statement of part (b) may make it 
easier than (a)— that’s one of the values of generalization.) 


22 


Si 0 


MIA. 


20. 
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Consider a function f with the following property: if g is any function for 
which lm g(x) does not exist, then lim[ f(x) + g(x)] also does not exist. 
x0 


x30 


Prove that this happens if and only if lm f(x) does exist. Hint: This is 
x0 

actually very easy: the assumption that hin f(x) does not exist leads to an 
> 


immediate contradiction if you consider the right g. 


This problem is the analogue of Problem 22 when f + g is replaced by f -g. 
In this case the situation is considerably more complex, and the analysis 
requires several steps (those in search of an especially challenging problem 
can attempt an independent solution). 


(a) Suppose that hm f() exists and is #0. Prove that if lm g(x) does not 
1G? x2 
exist, then hm f(x)g(x) also does not exist. 
x0 
(b) Prove the same result if him | f(x)| = oo. (The precise definition of this 
82 


sort of limit is given in Problem 37.) 
(c) Prove that if neither of these two conditions holds, then there is a function 
g such that lim g(x) does not exist, but him f(x)g(x) does exist. 
oe 20 


Hint: Consider separately the following two cases: (1) for some ¢ > 0 
we have |f(x)| > € for all sufficiently small x. (2) For every ¢ > 0, there 
are arbitrarily small x with |f(x)| < ¢. In the second case, begin by 
choosing points x, with |x,| < 1/n and | f(Q%,)| < 1/n. 


Suppose that A, is, for each natural number n, some /fuile set of numbers in 
[O. I], and that A,, and A,, have no members in common if m 4 n. Define 
f as follows: 


foo™ A xn AL, 


x notin A, for any n. 


Proventihat lui 7 (x) = Ofer all ain |0, li: 


Se 


Explain why the following definitions of lim f(x) =/ are all correct: 


Oe 8 | 


For every 6 > O there is an ¢ > O such that, for all x, 


G) af O < |x —a| <e, then |f(x)—/1| <6. 

(i) if O < |x —a| <e, then |f(x) —/1| <6. 

(m) if 0 < |x —al <e, then | f(x) —7| < 56. 

(iv) if O < lw —al| < €/10, then | f(x) —1| <6. 

Give examples to show that the following definitions of lim f(x) =/ are not 

correct. ee 

(a) For all 6 > 0 there is an € > O such'that if O < [x —a| < 6, then 
l[f(x)—1| <: 

(b) Forrallse > O thereus a 6 > 0 Such that if |f(@) —/1| <e;then 0 < 
|x —a| <6. 


112 Foundations 


Di. 


*28. 


29. 


30. 
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323 


oe 
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For each of the functions in Problem 4-17 indicate for which numbers a the 
one-sided limits lim f(x) and lim f(x) exist. 
i x>a7 


AU 36! 
(a) Do the same for each of the functions in Problem 4-19. 
(b) Also consider what happens if decimals ending in 0’s are used instead of 
decimals ending in 9’s, 


Prove that lm f(x) exists if tim, f(x) = lm f(x). 


xa ieeicl 

Prove that 
1 lmy f(x) = dim f(—x). 
0) 20% f x—0 f 
Gt peli f (|x) “ite f C2). 

x0 x 0+ 
see . 9} . 
fi seat cos), = slay (25) 

x—0 x= OF 
(These equations, and others like them, are open to several interpretations. 
They might mean only that the two limits are equal if they both exist; or that 
ifa certain one of the limits exists, the other also exists and is equal to it; or 


that if either Imut exists, then the other exists and is equal to it. Decide for 
yourself which interpretations are suitable.) 


Suppose that lim f(x) < lim f(x). (Draw a picture to illustrate this as- 
La a 


i> Cl 


sertion.) Prove that there is some 6 > 0 such that f(x) < f(y) whenever 
x <a<yand |x —a| <6 and |y —a| <6. Is the converse true? 


Prove that lim (a,x" + ---+a9)/(By»x" +---+ bo) (with a, 4 0 and by, 4 0) 


X00 
exists if and only if m > n. What is the limit when m =n? When m > 1? 


Hint: the one easy limit is lim 1/x* = 0; do some algebra so that this is the 
X¥>0O 


only information you need. 


Find the following limits. 


() lan x +sin” x 
r>0 «5x +6 
x six, 
ll lim 
(Gm) lim v xe+x—X. 
am \* jee *) eas) 


x>00 (x +sinx)? 


Prove that thine, (i /x))= lina of Gr): 
x(t X00 
Find the followmg limits in terms of the number a = him (sin x) eS 
a x( 


smn.x 


(1) lim 
. X00 YX 
naaeae cal 
(a) lum asift = 
X00 Xx 


ee 


36. 


37. 


38. 


40. 
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Define “iim f(x) =I” 
x<—>—6o 


(a) Find lim (a;,x" +--+++ d9)/(by x” + +--+ bo). 


X—>— 00 
(b) Prove that lim f(x)= lim /f(—x). 
xX-> OO X—>— OO 


(c) Prove that lim f(1/x)= lim f(x). 
x—0>- X—+>—0O 


We define lim f(x) = o¢ to mean that for all N there is a 6 > 0 such that, 


x—a 
for all x, if O < |x —a| <6, then f(x) > N. (Draw an appropriate picture!) 
(Of course, we may still say that lim f(x) “does not exist” in the usual sense.) 


x+—a 


(a) Show that lim 1/(x — 3)? = 00. 


t= 


(b) Prove that if f(x) > ¢ > 0 for all x, and lim g(x) = 0, then 


xX—a 


lim f(x)/|g(x)| = 00. 


(a) Define lim f(x) = oe and lim f(x) = oe. (Or at least convince your- 
x—at xa" 
self that you could write down the definitions if you had the energy. How 
many other such symbols can you define?) 


(b) Prove that lm 1/x = oo. 


xr—0* 


(c) Prove that lim f(x) = 0 if and only if lim f(1/x) = oe. 
x—> OO 


r—0* 


Find the following limits, when they exist. 


(i) = lim x(1 + sin? x). 
xO 


ae : Pai) 
(111) lim x sin* x. 


¥—+00 

re , ene 

(iv) hm x*sin -. 
x00 ic 

(v) lim ¥x-+2x —x. 
X—> OO 

(vi) lim x(/x+2—/%). 
X—> OO 

i. ee |x| 

(vil) lim ; 

x00 «CX 


(a) Find the perimeter of a regular n-gon inscribed in a circle of radius r. 
[Answer: 2rn sin(z/n).| 
(b) What value does this perimeter approach as n becomes very large? 


(c) What limit can you guess from this? 
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41. 


#42; 


(a) For c > 1, show that c!/" = V/e approaches | as n becomes very large. 
Hint: Show that for any ¢ > 0 we cannot have c!/" > 1 + ¢ for large n. 
(b) More generally, if ¢ > 0, then c!/" approaches | as n becomes very large. 


After sending the manuscript for the first edition of this book off to the printer, 


A i 5 7 
I thought of a much simpler way to prove that lim x? = a? and lim x* = 
xa xa 
3 


a”, without going through all the factormg tricks on page 92. Suppose, for 
. 2 2 : : 
example, that we want to prove that lim x“ = a“, where a > 0. Given 


ta 


€ > 0, we simply let 5 be the minimum of Ya? +¢ —a and a — Va? —€ 
- Figure. 19); then lx — a| <§ | implies that /a2—e < x < Va2+e, so 


—¢ <x? <a’?+e, or |x? —a?| < €. It is fortunate that these pages had 
a been set, so that I couldn't make these changes, because this “proof” 
is completely fallacious. Wherein lies the fallacy? 


FIGURE 1 


FIGURE 3 


CHAPTER 


DEFINITION 


CONTINUOUS FUNCTIONS 


If f is an arbitrary function, it is not necessarily true that 


linn! fe) f(a). 

x—a 
In fact, there are many ways this can fail to be true. For example, f might not 
even be defined at a, in which case the equation makes no sense (Figure 1). 


Again, lim f(x*) might not exist (Figure 2). Finally, as illustrated in Figure 3, 


38 act Ef 


even if f is defined at a and lim f(x) exists, the limit might not equal f(a). 


ead] 


@) ) © 
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We would like to regard all behavior of this type as abnormal and honor, with 
some complimentary designation, functions which do not exhibit such peculiarities. 
The term which has been adopted is “continuous.” Intuitively, a function f 1s 
continuous if the graph contains no breaks, jumps, or wild oscillations. Although 
this description will usually enable you to decide whether a function is continuous 
simply by looking at its graph (a skill well worth cultivating) it 1s easy to be fooled, 
and the precise definition is very important. 


The function f is continuous at a if 


him f(x) = f(a). 


We will have no difficulty finding many examples of functions which are, or are 
not, continuous at some number a—every example involving limits provides an 
example about continuity, and Chapter 5 certainly provides enough of these. 

The function f(«) = sin I/x is not continuous at 0, because it is not even defined 
at QO, and the same is true of the function g(v) = x sin 1/x. On the other hand, if 
we are willing to extend the second of these functions, that is, if we wish to define 


le 
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a new function G by 


a sinely/oa, wee 0 


C= | d, al). 


then the choice of a = G(O) can be made in such a way that G will be continuous 
at O— to do this we can (if fact, we must) define G(O) = 0 (Figure 4). This sort of 
extension is not possible for f; if we define 


eve | sim 1i/x. eel 


2 Oh x10) 
then F will not be continuous at 0, no matter what a is, because hm f(x) does 
ipo 
not exist. 
The function 
in x rational 
foe baton, 
Q), Xx irrational 
(a) is not continuous at a, if a 4 0, since lim f(x) does not exist. However, lim f(x) = 
F xa x0 
0 = f(0), so f is continuous at precisely one point, 0. 
an : 4 : 
Hhetunctions {(x) = cs POap=ertandeiG) =e are continuons at al nun 
bers a, since 
G . . 
lint f Ge: lin ¢c =e aia), 
2 (8 | Xa 
lim oy "= "hnte =fa—= ea), 
Gar o| iat eh 
ee ae 
limerérjt lin x” =e — hia 
Gm (2! Xa 
Finally, consider the function 
(b) hcg 0, X irrational 
= 
Sar teat L/@. x = p/q m lowest terms. 


In Chapter 5 we showed that hm f(«) = 0 for all a (actually, only for 0 < a < 1, 


PG fj 


but you can easily see that this is true for all a). Since 0 = f(a) only when a is 
irrational, this function is continuous at a if a ts irrational, but not if a is rational. 


It is even easier to give examples of continuity if we prove two simple theorems. 


THEOREM 1 If f and g are continuous at a, then 


(1) f+ is continuous at a, 
(2) f +g is contmuous at a. 


Moreover, if g(a) 4 0, then 


(3) 1/g is contnuous at a. 


PROOF 


THEOREM 2 


PROOF 
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Since f and g are continuous at a, 

lim f(x) = f(a) and lim g(x) = gta). 
By Theorem 2(1) of Chapter 5 this mmphies that 

lim Cf + g)(x) = f(a) + g(@) =(f + 8)@), 


which is just the assertion that f + g is continuous at a. The proofs of parts (2) 
and (3) are left to you. § 


Starting with the functions f(x) = c¢ and f(x) = x, which are continuous at a, 
for every a, we can use Theorem | to conclude that a function 
Da ps ee wa... + ho 
Code” se Ca es? eae 1) 


faj= 


is continuous at every point in its domain. But it is harder to get much further 

than that. When we discuss the sine function in detail it will be easy to prove that 

sin 1s continuous at a for all a; let us assume this fact meanwhile. A function like 
a Wh a) A. 

Sins Meche E ae Sinks 


7 ou 
x + 4x4 sin* x 


ee 


aD 
sim 


can now be proved continuous at every point in its domain. But we are still 

unable to prove the continuity of a function like f(x) = sin(x2); we obviously 

need a theorem about the composition of continuous functions. Before stating this 

theorem, the following point about the definition of continuity is worth noting, If 

we translate the equation hin f(x) = f(a) according to the definition of limits, 
Poe 


we obtain 


for every € > O there is 6 > O such that, for all x, 

if 0 < |x —a| <6, then | f(x) — f(a)| < «. 

But in this case, where the hmit is f(a), the phrase 
0 < |x -—a| <6 
may be changed to the simpler condition 
|x —a| <6, 

since if x =a it is certainly true that | f(x) — f(@| < e. 
If g is continuous at a, and f is continuous at g(a), then f og is continuous at a. 
(Notice that f is required to be continuous at g(a), not at a.) 
Let ¢ > 0. We wish to find a 6 > 0 such that for all x, 


if |x —a| < 6, then |[(f og)(x) — (f o g)(a)| < «€, 
Le, |f(g(x)) — f(g(a))| < «. 
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We first use continuity of f to estimate how close g(x) must be to g(a) in order 
for this inequality to hold. Since f is continuous at g(a), there is a 6’ > 0 such 
that for all y, 


(1) if |y — g(a)| < 6’, then | f(y) — f(g(a))| < e. 
In particular, this means that 
(25 if |g(x) — g(a)| < 6, then | f(g(x)) — f(g(a))| < «. 


We now use continuity of g to estimate how close x must be to a in order for the 
inequality |g(x) — g(a)| < 4’ to hold. The number 6’ is a positive number just like 
any other positive number; we can therefore take 4’ as the é (!) in the definition of 
continuity of g at a. We conclude that there is a 6 > O such that, for all x, 


(3) if |x —a| < 6, then |g(x) — g(a)| < 8. 
Combining (2) and (3) we see that for all x, 


if |x —a| <6, then | f(g(x)) — f(g(a))| < «. ff 


We can now reconsider the function 
xsin lem » 4 0 


Go = 0. ee 


We have already noted that f is continuous at 0. A few applications of Theorems | 
and 2, together with the continuity of sin, show that f 1s also continuous at a, for 
a £0. Functions like f(x) = sin(x? + sin(x + sin?(x3))) should be equally easy 
for you to analyze. 

The few theorems of this chapter have all been related to continuity of functions 
at a single point, but the concept of continuity doesn’t begin to be really interesting 
until we focus our attention on functions which are continuous at all pomts of some 
interval. If f is continuous at x for all x in (a,b), then f is called continuous 
on (a,b); as a “special case”, f is continuous on R = (—o0, ov) [see page 57] if 
it is continuous at x for all x in R. Continuity on a closed interval must be defined 
a little differently; a function f is called continuous on [a, b] if 


(1) f is continuous at x for all x in (a, b), 


(2) hn JOV=2/(@) and him J(e) =F (Db). 


(We also often simply say that a function is continuous if it is continuous at x for 
all x in its domain.) 

Functions which are continuous on an interval are usually regarded as especially 
well behaved; indeed continuity might be specified as the first condition which a 
“reasonable” function ought to satisfy. A continuous function is sometines de- 
scribed, intuitively, as one whose graph can be drawn without hftung your pencil 
from the paper. Consideration of the function 


sin k/x, w AO 


le i <*0 


flaj+e=5 


fla)-e= 


TGR: oS) 


THEOREM 3 


PROOF 


6. Continuous Functions 119 


shows that this description 1s a little too optimistic, but it is nevertheless tue that 
there are many important results involving functions which are continuous on an 
interval. There theorems are generally much harder than the ones in this chapter, 
but there is a simple theorem which forms a bridge between the two kinds of results. 
The hypothesis of this theorem requires continuity at only a single pomt, but the 
conclusion describes the behavior of the function on some interval containing the 
pomt. Although this theorem is really a lemma for later arguments, it 1s included 
here as a preview of things to come. 


Suppose f is continuous at a, and f(a) > 0. Then f(x) > 0 for all x in some 
interval containing a; more precisely, there is a number 6 > O such that f(x) > 0 
for all x satisfying |x —a| < 6. Similarly, if f(a) < 0, then there is a number 6 > 0 
such that f(x) < 0 for all x satisfying |x —a| < 4. 


Consider the case f(a) > 0. Since f is continuous at a, for every ¢ > 0 there is a 
Oo > /Q.sueh that, for alle; 


if |x —a| <6, then | f(x) — f(a@| < «. 
Le., —e < f(x) — fla) <e. 


In particular, this must hold for ¢ = i f(a), since 5 f(a) > 0 (Pigure 5)> Tlits 
there is 6 > O so that for all x, 


if |v —a| < 6, then —5 f(a) < f(*)— f(a) < 5 f(a), 


and this implies that f(x) > 4 f(a) > 0. (We could even have picked ¢ to be f(a) 
itself, leading to a proof that is more elegant, but more confusing to picture.) 

A similar proof can be given in the case f(a) < 0; take ¢ = —5. (a). Or one 
can apply the first case to the function —/f. 


PROBLEMS 


1. For which of the following functions f is there a continuous function F with 
domain R such that F(x) = f(x) for all x in the domain of f? 


; ae eal 


(it) «jf (x) = 0, x urrational, 
(iv) f(x) = 1/qg, x = p/q rational m lowest terms. 


2. At which pomts are the functions of Problems 4-17 and 4-19 continuous? 
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(a) Suppose that f is a function satisfying | f(x)| < |x| for all x. Show that 
f is continuous at 0. (Notice that f(0) must equal 0.) 

(b) Give an example of such a function f which is not continuous at any 
age 

(c) Suppose that g is continuous at 0 and g(O) = 0, and |f(x)| < |g(x)|. 
Prove that f is continuous at 0. 


Give an example of a function f such that f is continuous nowhere, but | f| 
is continuous everywhere. 


For each number a, find a function which is continuous at a, but not at any 
other points. 


(a) Finda function f which ts discontinuous at 1, 5. 7 q---+ but continuous 
at all other points. 

(b) Find a function f which is discontinuous at 1, 7 i a, aud at) but 
continuous at all other points. 


Suppose that f satisfies f(x + y) = f(x) + f(y), and that f is continuous 
at Q. Prove that f is continuous at a for all a. 


Suppose that f is continuous at a and f(a) = 0. Prove that if wa 4 0, then 
f +a is nonzero in some open interval containing a. 


(a) Suppose f is defined at a but is not continuous at a. Prove that for 
some number ¢ > 0 there are numbers x arbitrarily close to a with 
[f(x) — f(a)| > &. Illustrate graphically. 

(b) Conclude that for some number ¢ > O either there are numbers x arbi- 
trarily close to a with f(x) < f(a) —e or there are numbers x arbitrarily 
close to a with f(x) > f(a) +e. 


(a) Prove that if f is continuous at a, then so is | f|. 

(b) Prove that every function f continuous on R can be written f = E+ 0, 
where E is even and continuous and O 1s odd and continuous. 

(c) Prove that if f and g are continuous, then so are max(f,g) and 
min(f, g). 

(d) Prove that every continuous f can be written f = g—h, where g and h 
are nonnegative and continuous. 


Prove Theorem 1(3) by using Theorem 2 and continuity of the function 
(x) = dee 
(a) Prove that if f is continuous at / and hm g(x) =/, then hin f(g(v)) = 
Ya Va 
f(l). (You can go right back to the definitions, but it is easier to consider 
the function G with G(x) = g(x) for x 4a, and G(a) =1.) 


(b) Show that if continuity of f at / is not assumed, then it is not generally 
trut that litt f(e(x)) = fim 2@)) oAimteity f(r) =0 for x - ¢, and 
F od ¢ | va 


4) = I 


13% 


14. 


15. 


16. 


17. 
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(a) Prove that if f is continuous on [a,b], then there is a function g which 
is continuous on R, and which satisfies g(x) = f(x) for all x in [a,b]. 
Hint: Since you obviously have a great deal of choice, try making g 
constant on (—oo, a| and [b, oo). 

(b) Give an example to show that this assertion is false if [a,b] is replaced 


by (a, D). 


(a) Suppose that g and hf are continuous at a, and that g(a) = h(a). Define 
f(x) to be (x) if x = a and h(x)af « <a. Prove*that f is*contmuous 
at a, 

(b) Suppose g is continuous on [a,b] and fh is continuous on [b,c] and 
g(b) = hitb) milLeney (ebeaa(s ator memmslawb|eand! /i(x) foxx in [bic). 
Show that f is continuous on [a,c]. (Thus, contmuous functions can be 
“pasted together”.) 


Prove that if f is continuous at a, then for any ¢ > 0 there is a 6 > 0 so that 


SCO) — fee. 


(a) Prove the following version of ‘Theorem 3 for “right-hand continuity”: 
Suppose that lim f(x) = f(a), and f(a) > 0. Then there is a number 
x—at 


whenever |x — a| < 6 and |y — a| < 6, we have 


6 > Osuch that f(x) > O for all x satisfying O < x —a < 8. Similarly, 
if f(a) < 0, then there is a number 6 > O such that f(x) < O for all x 
SaLiiay gig 0) SM oe 


(b) Prove a version of Theorem 3 when him iw = Fi). 


If lim f(x) exists, but is # f(a), then f is said to have a removable dis- 
Xue Ch 

continuity at a. 

(a) If f(x) = sin 1 /& for x #0 and f(0) = 1, does f have apremovable 
discontinuity at 0? What if f(@) =xsin1/x for x 40, and f(O) = I? 

(b) Suppose f has a removable discontinuity at a. Let g(x) = f(x) for 
x #£a,and let g(a) = lim f(x). Prove that g is continuous at a. (Don’t 

x—a 

work very hard; this is quite easy.) 

(c) Let f(x) =0 if x is irrational, and let f(p/q) = 1/q if p/q is in lowest 
terms. What is the function g defined by g(x) = lim f(y)? 

yox 
*(d) Let f be a function with the property that every point of discontinuity 
is a removable discontinuity. “This means that lim f(y) exists for all x, 
you 
but f may be discontinuous at some (even infinitely many) numbers x. 
Define g(x) = hm f(y). Prove that g is continuous. (This is not quite 
es 


so easy as part (b).) 


**(e) Is there a function f which is discontinuous at every poimt, and which has 


only removable discontinuities? (It is worth thinking about this problem 
now, but mamily as a test of intuition; even if you suspect the correct 
answer, you will almost certainly be unable to prove it at the present 
time. Sec Problem 22-33.) 


CHAPTER THREE HARD THEOREMS 


‘This chapter is devoted to three theorems about continuous functions, and some 
of their consequences. ‘The proofs of the three theorems themselves will not be 
given until the next chapter, for reasons which are explained at the end of this 
chapter. 


THEOREM 1_ If f is continuous on [a,b] and f(a) < 0 < f(b), then there is some x in [a,b] 
suchsthritep (x) =O. 


(Geometrically, this means that the graph of a continuous function which starts 
below the horizontal axis and ends above it must cross this axis at some point, as 
in Figure 1.) 


THEOREM 2 If f is continuous on [a,b], then f is bounded above on [a, b], that is, there is 
some number WN such that f(x) < N for all x in [a, b]. 


(Geometrically, this theorem means that the graph of f lies below some line par- 
allel to the horizontal axis, as in Figure 2.) 


THEOREM 3 If f is continuous on |a,b], then there is some number y in [a,b] such that 
f£ = f@) forall x in [a, b] (Figure3). 


These three theorems differ markedly from the theorems of Chapter 6. The 
hypotheses of those theorems always involved continuity at a single point, while 


FN ee Tye ears 


PiIGURE I FIGURE 2 FIG WIRE 3 


FIGURE 4 


ERG URIESS 


FIGURE 6 


THEOREM 4 
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the hypotheses of the present theorems require continuity on a whole interval 
(a, b]| if continuity fails to hold at a single point, the conclusions may fail. For 
example, let f be the function shown in Figure 4, 


=], Nee gD 
be 


Then f is continuous at every point of [0,2] except J/2, ands (0), < Orme). 
but there is no point x in [0, 2] such that f(+) = 0; the discontinuity at the single 


f= 


point V2 is sufficient to destroy the conclusion of Theorem 1. 


Similarly, suppose that f is the function shown in Figure 5, 


laine Kes © 


te ing 8 che 0) 


Then f is continuous at every point of [0, 1] except 0, but f is not bounded above 
on [0, 1]. In fact, for any number N > 0 we have f(1/2N) =2N > N. 

This example also shows that the closed interval [a, b] in Theorem 2 cannot be 
replaced by the open interval (a, 6), for the function f is continuous on (0, 1), but 
is not bounded there. 


Finally, consider the function shown in Figure 6, 
se. S ’ 


On the interval [0,1] the function f 1s bounded above, so f does satisfy the 
conclusion of Theorem 2, even though f is not continuous on [0,1]. But f 
does not satisfy the conclusion of Theorem 3— there is no y in [0,1] such that 
f(y) = f(x) for all x in [0, 1]; im fact, it is certainly not true that f(1) = f(x) for 
all x in [0, 1] so we cannot choose y = 1, nor can we choose 0 < y < | because 
f(y) < f(@) if x is any number with y < x < |. 

This example shows that Theorem 3 is considerably stronger than Theorem 2. 
Theorem 3 is often paraphrased by saying that a continuous function on a closed 
interval “takes on its maximum value” on that interval. 

As a compensation for the stringency of the hypotheses of our three theorems, 
the conclusions are of a totally different order than those of previous theorems. 
They describe the behavior of a function, not just near a point, but on a whole in- 
terval; such “global” properties of a function are always significantly more difficult 
to prove than “local” properties, and are correspondingly of much greater power. 
‘To illustrate the usefulness of Theorems |, 2, and 3, we will soon deduce some im- 
portant consequences, but it will help to first mention some simple generalizations 
of these theorems. 


If f is continuous on [a,b] and f(a) < c < f(b), then there is some x im [a,b] 
suchthat fi(ay): «es 
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PROOF 


THEOREM 5 


PROOF 


THEOREM 6 


PROOF 


THEOREM 7 


PROOF 


THEOREM 8 


Let g = f —c. Then g is continuous, and g(a) < 0 < g(b). By Theorem 1, there 
is some x in [a,b] such that g(x) = 0. But this means that f(x) = c. J 


If f is continuous on [a,b] and f(a) > ¢ > f(b), then there is some x in [a,b] 
sucht that f@) = c. 


The function —f is continuous on [a,b] and — f(a) < -c < —f(b). By The- 
orem 4 there is some x in [a,b] such that —f(x) = —c, which means that 


faj=ecl 


Theorems 4 and 5 together show that f takes on any value between f(a) 
and f(b). We can do even better than this: if ¢ and d are in [a,b], then f 
takes on any value between f(c) and f(d). The proof is simple: if, for example, 
c < d, then just apply Theorems 4 and 5 to the interval [c,d]. Summarizing, if'a 
continuous function on an interval takes on two values, it takes on every value in 
between; this shght generalization of Theorem | is often called the Intermediate 
Value Theorem. 


If f is continuous on [a,b], then f is bounded below on [a, b], that is, there is 
some number N such that f(x) > N for all x in [a, b]. 


The function — f is continuous on [a,b], so by Theorem 2 there is a number M 
such that — f(x) < M for all x in [a, b]. But this means that f(x) => —M for all x 
in [a,b], so we can let N= —M. J 


Theorems 2 and 6 together show that a continuous function f on [a,b] is 
bounded on [a, b], that is, there is a number N such that | f(«)| < N for all x in 
[a,b]. In fact, since Theorem 2 ensures the existence of a number Ny such that 
f(x) < N, for all x in [a, b], and Theorem 6 ensures the existence of a number 
N> such that f(x) => No for all x mn [a, b], we can take N = max({Nj|. |N2|)- 


. 


If f is continuous on [a, b], then there is some y in [a,b] such that f(y) < f(x) 
for all x in [a, b]. 

(A continuous function on a closed interval takes on its minimum value on that 
interval.) 


The function —f is continuous on [a,b]; by Theorem 3 there is some y in [a,b] 
such that — f(y) => —f(x) for all x in [a,b], which means that f(y) < f(x) for 


all x in [a,b]. fj 


Now that we have derived the trivial consequences of ‘Vheorems 1, 2, and 3, we 


can begin proving a few miteresting things. 


very positive number has a square root. In other words, if a > O, then there is 
3) 
some number x such that +* =a. 


FIGURE 7 


PROOF 


THEOREM 9 


PROOF 
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Consider the function f(x) = x? 


, which is certainly continuous. Notice that the 
statement of the theorem can be expressed in terms of f: “the number @ has a 
square root” means that f takes on the value a. The proof of this fact about f 
will be an easy consequence of Theorem 4. 

There is obviously a number b > O such that f(b) > @ (as illustrated in Figure 7); 
m fact, f@ @ > we can take b = a, while if w@ < ® we cam take b =MlL_ Since 
f(O) <a < f(b), Theorem 4 applied to [0, b] implies that for some x (in [0, b}), 
we have f(x) =a, ie. x7 =a. J 


Precisely the same argument can be used to prove that a positive number has 
an nth root, for any natural number n. If n happens to be odd, one can do 
better: every number has an nth root. To prove this we just note that if the positive 
number a@ has the nth root x, .e., if x" =a, then (—x)” = —a (since n is odd), so 
—q has the nth root —x. The assertion, that for odd n any number @ has an ath 
root, is equivalent to the statement that the equation 


ee 0) 
has a root if n is odd. Expressed in this way the result 1s susceptible of great 
generalization. 
If n is odd, then any equation 
ao Gg mia othe + ap = 0 


has a root. 


We obviously want to consider the function 
F(X) =x" + ay_px" | +++ +49; 


we would like to prove that f is sometimes positive and sometimes negative. The 
intuitive idea is that for large |x], the function is very much like g(x) = x” and, 
since 1 is odd, this function is positive for large positive x and negative for large 
negative x. A little algebra is all we need to make this intuitive idea work. 

The proper analysis of the function f depends on writing 


an a 
f(x) =x" +g px" | $e Fag =x" (1 ale eee =). 
hs 


Note that 


An—1 | An-2 ao} — lan—1| ao 
Xx x? ie |x| ee 
Consequently, if we choose x satisfying 
() el 2H lan alee acer ag. 
then |x*| > |x| and 
lan—x| lana lan—k| 


l 
|x*| Ix! 2nla,—4| 2a’ 
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sO 
a) EG,» ay ] l 
ae SD ug ay te 
oe x? ae 2n pipes 
eee 
n terms 
In other words, 
| @,— a l 
ae | . Bal] eee 
> 4bs ee oP 
which implies that 
l Gia a 
ie = | ae n ] : - eo 
zZ x w 


Therefore, if we choose an x; > 0 which satisfies (*), then 


Gai)" 


ym 


%) ay 
x] Guns 


so that f(x;) > 0. On the other hand, if x2 < 0 satisfies (*), then (x2)” < 0 and 


(x2 " 


_ 


an 
=) ( = = ) = f(x), 


ae An—| ag 
> (x2) ( 4- feet ) = f(x), 


by) (4%)” 


so that f(x%o)e= 0. 
Now applying ‘Theorem | to the interval [.¥2..4)] we conclude that there is an x 
in [x2,.x;] such that f(x) = 0. ff 


Theorem 9 disposes of the problem of odd degree equations so happily that it 
would be frustrating to leave the problem of even degree equations completely 
undiscussed. At first sight, however, the problem seems insuperable. Some equa- 
tions, like x2 — 1 = 0, have a solution, and some, like x* + 1 = 0, do not— what 
more is there to say? If we are willing to consider a more general question, how- 
ever, something interesting can be said. Instead of trying to solve the equation 


a” EE ge ep See ee eee Oh 


let us ask about the possibility of solving the equations 


| 


xt Gee 8 4 eg = CC 


for all possible numbers c. ‘This amounts to allowing the constant term ag to vary. 
The information which can be given concerning the solution of these equations 
depends on a fact which is illustrated in Figure 8. 

Theeraph of the function () = x"--a,_ (x? l4...4+ag, with n even, contains, 
at least the way we have drawn it, a lowest point. In other words, there is a 
number y such that f(y) < f(x) for all numbers x— the function f takes on a 
minimum value, not just on each closed interval, but on the whole line. (Notice 
that this is false if 2 1s odd.) ‘The proof depends on Theorem 7, but a tricky 
application will be required. We can apply ‘Theorem 7 to any interval [a,b], and 
obtain a pot yo such that f(yo) is the minimum value of f on [a,b]; but if [a,b] 
happens to be the interval shown in Figure 8, for example, then the point yo will 
not be the place where f has its nunimum value for the whole line. In the next 
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theorem the entire poimt of the proof is to choose an interval [a,b] in such a way 
that this cannot happen. 


THEOREM 10 If 7 is even and f(x) = x" + a,-1x" | +-++++ ao, then there is a number y such 
that_ £0). = 60a) forall x. 
PROOF As in the proof of Theorem 9, if 
i See oe ee |e 2n|do|), 


then for all x with |x| => M, we have 
i an-| ao 
~ SP fef—. 
Z oe 
Since.n is even, «t= 0 for alll x, so 


set 
— sx" ($e t 
ps § 


provided that |x| => M. Now consider the number f(0). Let b > 0 be a number 
such that b” > 2 f(0) and also b > M. Then, if x => b, we have (Figure 9) 


oe = FO) 
ie 
pf Similarly, if x < —b, then 
5 i =/p) Hn Hh 
FIGURE 9 ule Re - = = 7) 


Summarizing: 


ie Dor, = —p, then f(*) = f (©). 


Now apply Theorem 7 to the function f on the interval [—b, b]. We conclude 
that there is a number y such that 


(1) i ore —'b, then f(y) = f@). 
In particular, f(y) < f(0). Thus 
(2) ieee) or x =p, then | (~¥) =f OF} = FO). 


Combining (1) and (2) we see that f(y) < f(x) for all x. J 


Theorem 10 now allows us to prove the following result. 


THEOREM 11 ~— Consider the equation 


| 


(>) etary 4: +g =, 
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G 
b 
m 


and suppose 1 1s even. ‘Then there is a number m such that (*) has a solution for 
c > m and has no solution for c < m. 


ena ht) =r ee een, (Figure 10). 

According to Theorem 10 there is a number y such that f(y) < f(x) for all x. 
Let m = f(y). If ¢ < m, then the equation (+) obviously has no solution, since 
the left side always has a value => m. If c = m, then (*) has y as a solution. 
Finally, suppose c > m. Let b be a number such that b > y and f(b) > c. Then 
f(y) =m <c < f(b). Consequently, by Theorem 4, there is some number x in 
[y, b] such that f(x) =c, so x is a solution of (x). J 


These consequences of Theorems 1, 2, and 3 are the only ones we will derive 
now (these theorems will play a fundamental role in everything we do later, how- 
ever). Only one task remains—to prove Theorems 1, 2, and 3. Unfortunately, 
we cannot hope to do this——on the basis of our present knowledge about the real 
numbers (namely, P1—P12) a proof is unpossible. There are several ways of con- 
vineing ourselves that this gloomy conclusion is actually the case. For example, 
the proof of Theorem 8 relies only on the proof of Theorem 1; if we could prove 
Theorem 1, then the proof of Theorem 8 would be complete, and we would have 
a proof that every positive number has a square root. As pomted out in Part I, it 
is impossible to prove this on the basis of P]~P12. Again, suppose we consider the 
function 


l 
fx) = are) 
If there were no number x with x? = 2, then f would be continuous, since the 
denominator would never = 0. But f is not bounded on [0,2]. So Theorem 2 
depends essentially on the existence of numbers other than rational numbers, and 
therefore on some property of the real numbers other than P1—P12. 

Despite our inability to prove Theorems |, 2, and 3, they are certainly results 
which we want to be true. If the pictures we have been drawing have any con- 
nection with the mathematics we are doing, if our notion of continuous function 
corresponds to any degree with our intuitive notion, Theorems |, 2, and 3 have 
got to be true. Since a proof of any of these theorems must require some new 
property of R which has so far been overlooked, our present difficulties suggest a 
way to discover that property: let us try to construct a proof of ‘Theorem 1, for 
example, and see what goes wrong, 

One idea which seems promising is to locate the first poimt where f (+) = 0, that 
is, the smallest x in [a,b] such that f(x) = 0. ‘To find this point, first consider 
the set A which contams all numbers x in [a,b] such that f is negative on [a,x]. 
In Figure 11, x is such a poimt, while x’ is not. ‘The set A itself is indicated by a 
heavy line, Since f is negative at a, and positive at b, the set A contams some 
points greater than a, while all points sufficiently close to b are not in A. (We are 
here using the continuity of f on [a,b], as well as Problem 6-16.) 


f(x) < 0 for all x in this interval 


A would also contain 
all these points 


PNG WR 2 


only this big 


FIGURE 13 


A could really be 


a 4 b 
f(x) > 0 for all x 


in this interval 
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Now suppose @ is the smallest number which is greater than all members of A; 
clearly a < a < b. We claim that f(a) = 0, and to prove this we only have to 
eliminate the possibilities f(a) <0 and f(a) > 0. 

Suppose first that f(a) < 0. Then, by Theorem 6-3, f(x) would be less than 0 
for all x ma small interval containing a, in particular for some numbers bigger 
than @ (Figure 12); but this contradicts the fact that @ is bigger than every member 
of A, since the larger numbers would also be in A. Consequently, f(a) < 0 is 
false. 

On the other hand, suppose f(@) > 0. Again applying Theorem 6-3, we see that 
f(x) would be positive for all x in a small interval containing q, in particular for 
some numbers smaller than @ (Figure 13). This means that these smaller numbers 
are all not in A. Consequently, one could have chosen an even smaller @ which 
would be greater than all members of A. Once again we have a contradiction; 
f(a) > Os also false. Hence f(a) = 0 and, we are tempted to say, Q.E.D. 

We know, however, that something must be wrong, since no new properties of R 
were ever used, and it does not require much scrutiny to find the dubious point. 
It is clear that we can choose a number a which is greater than all members of A 
(for example, we can choose a = 5), but it is not so clear that we can choose a 
smallest one. In fact, suppose A consists of all numbers x > 0 such that x? < 2. 
If the number V2 did not exist, there would not be a least number greater than 
all the members of A; for any y > V2 we chose, we could always choose a still 
smaller one. 

Now that we have discovered the fallacy, it is almost obvious what additional 
property of the real numbers we need. All we must do is say it properly and use it. 
That is the business of the next chapter. 


PROBLEMS 


1. Foreach of the following functions, decide which are bounded above or below 
on the indicated interval, and which take on their maximum or minimum 
value. (Notice that f might have these properties even if f is not continuous, 
and even if the interval is not a closed interval.) 


Qe foe x” once 1). 
ay =r = x? on (1,1). 
Gil) f(x) =x? on R. 

(iv) f(x) =x? on [0, 00). 


2) 
Do Asa : 
(Vi ee = on (—a— l,a+1). (We assume a > —'1, so 
| G+ 2,8 Sia 
that —a— 1 <a+1; it will be necessary to consider several possibilities 
for a.) 
e Y= a 
: A 5 7 € . 
(1) aaa = on [—a— l.a +1]. (Again assume a > —1.) 
Geen a >a , 
0. x irrational 


Cie i (@) = | on [0,1]. 


1/q. x = p/q m lowest terms 
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ds xX irrational 
OU dei | 1/q, x = p/q in lowest terms oes aE 
ie x irrational 
PECTS | —1/q, x = p/q in lowest terms °" Mey. 


; i Varational 
x %) = ae 

es slales) | Q, x irrational 
Ga f & = sin? (cos x + Va+a*) on (0, a), 

Gal) yfi(x) = [x )Jeons(0; a): 

For each of the following polynomial functions /, find an integer 7 such that 
f(x) =0 for some x between n and n+ 1. 


on [0, a]. 


fief a) = x? eae. 
Gi). f(x) = xP + 5xF 2x 21 
(ii) f(x) =x? +241. 

ivi et (se) = 4x? = 4y +1. 


Prove that there is some number x such that 

: 163 

(1) x19 4 —______ = 119, 
l+x-¢+sin* x 

Gi)messiney = 2% —el. 


This problem is a continuation of Problem 3-7. 


(a) If nm —k is even, and > 0, find a polynomial function of degree n with 
exactly & roots. 

(b) A root a of the polynomial function f is said to have multiplicity m 
if f(x) = (x — a)" g(x), where g is a polynomial function that does not 
have a as a root. Let f be a polynomial function of degree n. Suppose 
that f has k roots, counting multiplicities, 1e., suppose that & is the sum 
of the multiplicities of all the roots. Show that n — k ts even. 


Suppose that f is continuous on [a,b] and that f(x) is always rational. What 
can be said about f? 


Suppose that f ts a continuous function on {[—1, 1] such that x24 (f(x) = 
for all x. (This means that (x, f(¥)) always lies on the unit circle.) Show that 


either f(x) = V1 — x? for all x, or else f(x) = —V 1 — x? for all x. 


How many continuous functions f are there which satisfy (f(x))? = x? for 
allex? 

Suppose that f and g are continuous, that f? = g7, and that f(x) 4 0 for 
all x. Prove that cither f(x) = g(x) for all x, or else f(x) = —g(x) for all x. 


(a) Suppose that f is continuous, that f(*) = 0 only for x = a, and that 
f(x) > 0 for some x > a as well as for some x < a. What can be said 
about f(x) for all « 4.a? 


FIGURE 14 


(Tal) 


10. 


11. 


12. 


13. 


14. 


*E5. 


*16. 
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(b) Again assume that f is continuous and that f(x) = 0 only for x = a, 
but suppose, instead, that f(x) > 0 for some x > a and f(x) < 0 for 
some x < a. Now what can be said about f(x) for x 4 a? 

*(c) Discuss the sign of x3 +x7y +xy* + y? when x and y are not both 0. 


Suppose f and g are continuous on [a,b] and that f(a) < g(a), but f(b) > 
g(b). Prove that f(x) = g(x) for some x in [a,b]. (If your proof isn’t very 
short, it’s not the right one.) 


Suppose that f is a continuous function on [0, 1] and that f(x) is in [0, 1] 
for each x (draw a picture). Prove that f (+) = x for some number x. 


(a) Problem 11 shows that f intersects the diagonal of the square in Fig- 
ure 14 (solid line). Show that f must also intersect the other (dashed) 
diagonal. 

(b) Prove the following more general fact: If g is continuous on [0, 1] and 
e(0) 10, el) = Cor 20) =11, eC!) = 0, then (a) = ce) fomsamens 


(a) Let 7 (4) = simply: for x 4 0 and let f(0) = 0: Is f contmmnouston 
[—1. 1]? Show that f satisfies the conclusion of the Intermediate Value 
Theorem on [—1, 1]; in other words, if f takes on two values somewhere 
on {—1, 1], it also takes on every value in between. 

*(b) Suppose that f satisfies the conclusion of the Intermediate Value Theo- 
rem, and that f takes on each value only once. Prove that f is continuous. 

*(c) Generalize to the case where f takes on each value only finitely many 
times. 


If f is a continuous function on [0, 1], let || f|| be the maximum value of | f| 


on (0; !],. 


(a) Prove that for any number c we have ||cf|| = lel - fl. 
*(b) Prove that || f + gll < Il fll + Ilgll. Give an example where || f + g|| 4 
fll + Ilgll. 


(c) Wrovethate||2 — 7 || = ||4 — eil|-- 


i — alle 


Suppose that ¢ is continuous and lim ¢(x)/x" =O= lim (x)/x". 
Boo Xo, 9) eter 


(a) Prove that if n is odd, then there is a number x such that +" +(x) = 0. 
(b) Prove that rf ” is even, then there is a number y such that y" + ¢(y) < 
x" + d(x) for all x. 


Hint: Of which proofs does this problem test your understanding? 


(a) Suppose that f 1s continuous on (a,b) and lm f(x) = lm f(x) = oo. 
x—at v j 
Prove that f has a mmimium on all of (a,b). 


x—>b- 
(b) Prove the corresponding result when a = —oo and/or b = oo. 


Let f be any polynomial function. Prove that there ts some number y such 


that-(fA(y)< |G) |sfor-all x. 


132 Foundations 


*18. Suppose that f is a continuous function with f(x) > O for all x, and 


low @) = Cee low f(+)e (Draw a picsire:) Prove that there is some 
Kix eS > — OO 


number y such that f(y) => f(x) for all x. 
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(b) 


(d) 


Suppose that f is continuous on [a, b], and let x be any number. Prove 
that there is a point on the graph of f which is closest to (v,0); in 
other words there is some y in [a, b] such that the distance from (x, 0) 
to (y, f(y)) is < distance from (x, 0) to (z, f(z)) for all z in [a, b]. Gee 
Figure 15.) 

Show that this same assertion is not necessarily true if [a, b] is replaced 
by (a, b) throughout. 

Show that the assertion 7s true if [a, b] is replaced by R throughout. 

In cases (a) and (c), let g(x) be the minimum distance from (x, 0) to a 
point on the graph of f. Prove that g(y) < g(x) + |x — y|, and conclude 
that g is continuous. 

Prove that there are numbers x9 and x; in [a,b] such that the distance 
from (%p,0) to Cy, f (x%1)) see the distance trom (x, 0) to (4). f ae) 
for any xo’, x1’ m [a, bd]. 


Suppose that f is continuous on [0,1] and f(0) = f(1). Let n be any 
natural number. Prove that there is some number x such that f(x) = 
f(x+1/n), as shown in Figure 16 for n = 4. Hint: Consider the function 
g(x) = f(x) — f(x + 1/n); what would be true if g(x) 4 0 for all x? 
Suppose 0 < a < 1, but that a is not equal to 1/n for any natural 
number n. Find a function f which is continuous on [0, 1] and which 
satishes f(0) = f(1), but which does not satisfy f(x) = f(x + a) for 
any x. 


Prove that there does not exist a continuous function f defined on R 
which takes on every value exactly twice. Hint: If f(a) = f(b) for 
a <= DMihicnveither. f 4S (ator all x in (@& DY or fix) = Y@)Mer 
all x in (a,b). Why? In the first case all values close to f(a), but shghtly 
larger than f(a), are taken on somewhere in (a,b); this imphes that 
tobe ta) ior x < a ance 

Refine part (a) by proving that there is no contmuous function f which 
takes on each value either 0 times or 2 times, 1.e., which takes on exactly 
twice each value that it does take on. Hint: ‘The previous hint implies 
that f has either a maximum or a mimimum value (which must be taken 
on twice). What can be said about values close to the maximum value? 
Find a continuous function f which takes on every value exactly 3 mes. 
More generally, find one which takes on every value exactly n times, if 
nis odd. 

Prove that if nm is even, then there is no continuous f which takes on 
every value exactly n times. Hint: ‘To treat the case n = 4, for example, 
let faq) = f Cro) = flws) = fim). Phet Githtie f(m) = Onfor all ten 
two of the three intervals (x),.62), (42.43), (43. %4), or else f(x) < 0 for 
all x in two of these three intervals. 


CHAPTER 


DEFINITION 


DEFINITION 


LEAST WPPER BOUNDS 


‘This chapter reveals the most important property of the real numbers. Never- 
theless, it is merely a sequel to Chapter 7; the path which must be followed has 
already been indicated, and further discussion would be useless delay. 


A set A of real numbers is bounded above if there is a number x such that 


hy Salle [OTEVERYaaamny A 


Such a number x is called an upper bound for A. 


Obviously A is bounded above if and only if there is a number x which is an 
upper bound for A (and in this case there will be lots of upper bounds for A); we 
often say, as a concession to idiomatic English, that “A has an upper bound” when 
we mean that there is a number which is an upper bound for A. 

Notice that the term “bounded above” has now been used in two ways~ first, in 
Chapter 7, in reference to functions, and now in reference to sets. ‘Uhis dual usage 
should cause no confusion, since it will always be clear whether we are talking 
about a set of numbers or a function. Moreover, the two definitions are closely 
connected: if A is the set {f(x) : a < x < b}, then the function f is bounded 
above on [a, b] if and only if the set A is bounded above. 

The entire collection R of real numbers, and the natural numbers N, are both 
examples of sets which are not bounded above. An example of a set which zs 
bounded above is 


eae 1) a 


To show that A is bounded above we need only name some upper bound for A, 
which is easy enough; for example, 138 is an upper bound for A, and so are 2, 
15, 1+. and 1. Clearly, | is the least upper bound of A; although the phrase 
just introduced is self-explanatory, in order to avoid any possible confusion (in 
particular, to ensure that we all know what the superlative of “less” means), we 
define this explicitly. 


A number x is a least upper bound of A if 


(1) x is an upper bound of A, 


and (2) if y is an upper bound of A, then x < y. 
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The use of the indefinite article “a” m this definition was merely a concession 
to temporary ignorance. Now that we have made a precise definition, it is easily 


seen that if x and y are both least upper bounds of A, then x = y. Indeed, in this 


case 
x <y, since y is an upper bound, and x is a least upper bound, 
and y <x, since x is an upper bound, and y is a least upper bound; 

it follows that x = y. For this reason we speak of the least upper bound of A. 


‘The term supremum of A is synonymous and has one advantage. It abbreviates 
quite nicely to 
sup A (pronounced “soup A”’) 


and saves us from the abbreviation 
lub A 


(which is nevertheless used by some authors). 

There is a series of important definitions, analogous to those just given, which 
can now be treated more briefly. A set A of real numbers is bounded below if 
there is a number x such that 


x =1a for every a in A. 


Such a number x is called a lower bound for A. A number x 1s the greatest 
lower bound of A if 


(1) x is a lower bound of A, 
and (2) if y isa lower bound of A, then x > y. 


The greatest lower bound of A is also called the infimum of A, abbreviated 
inf A: 

some authors use the abbreviation 
glb A. 


One detail has been omitted from our discussion so far— the question of which 
sets have at least one, and hence exactly one, least upper bound or greatest lower 
bound. We will consider only least upper bounds, since the question for greatest 
lower bounds can then be answered easily (Problem 2). 

If A is not bounded above, then A has no upper bound at all, so A certainly 
cannot be expected to have a least upper bound. It is tempting to say that A does 
have a least upper bound if it has some upper bound, but, like the principle of 
mathematical induction, this assertion can fail to be true in a rather special way. 
If A = %, then A is bounded above. Indeed, any number x is an upper bound 
for 0: 


x >y for every y in Z 


simply because there is no y in J. Since every number is an upper bound for 9, 
there is surely no least upper bound for %. With this trivial exception however, 


FIGURE I 
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f(x) < 0 for all x 


in this interval 


FIGURE 2 
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our assertion is true~~-and very important, definitely important enough to warrant 
consideration of details. We are finally ready to state the last property of the real 
numbers which we need. 


(P13) (The least upper bound property) If A is a set of real numbers, 
A #94, and A is bounded above, then A has a least upper bound. 


Property P13 may strike you as anticlimactic, but that is actually one of its 
virtues. ‘To complete our list of basic properties for the real numbers we require no 
particularly abstruse proposition, but only a property so simple that we might feel 
foolish for having overlooked it. Of course, the least upper bound propcrty is not 
really so mnocent as all that; after all, it does not hold for the rational numbers Q. 
For example, if A is the set of all rational numbers x satisfying x* < 2, then there 
is no rational number y which is an upper bound for A and which is less than or 
equal to every other rational number which is an upper bound for A. It will become 
clear only gradually how significant P13 is, but we are already in a position to 
demonstrate its power, by supplying the proofs which were omitted in Chapter 7. 


If f is continuous on [a,b] and f(a) < 0 < f(b), then there is some number x 
in-|@, b)jisuchsthar f@) =@) 


Our proof is merely a rigorous version of the outline developed at the end of 
Chapter 7—we will locate the smallest number x in [a, b] with f(x) = 0. 
Define the set A, shown in Figure 1, as follows: 


A ={|x:a<x <b, and f is negative on the interval [a, x]}. 


Clearly A 4 ¥, since a is in A; in fact, there 1s some 6 > O such that A contains 
all points x satisfying a < x < a+; this follows from Problem 6-16, since f is 
continuous on [a,b] and f(a) < 0. Similarly, b is an upper bound for A and, m 
fact, there is a 6 > O such that all points x satisfying b — 6 < x < b are upper 
bounds for A; this also follows from Problem 6-16, since f(b) > 0. 

From these remarks it follows that A has a least upper bound @ and that 
a <a <b. We now wish to show that f(a) = 0, by eliminating the possibil- 
ities f(a) < Oand f(a) > O. 

Suppose first that f(@) < 0. By Theorem 6-3, there is a 5 > 0 such that 
f(x) < 0 for a —45 < x < a+ 6 (Figure 2). Now there is some number x9 in A 
which satisfies a — 6 < x9 < @ (because otherwise a would not be the least upper 
bound of A). ‘This means that f is negative on the whole interval [a, xo]. But if 
x, 1s anumber between @ and w + 4, then f is also negative on the whole interval 
[x9, x1]. Therefore f is negative on the interval [a,x;], so x, isim A. But this 
contradicts the fact that @ is an upper bound for A; our original assumption that 
f(a) <0 must be false. 

Suppose, on the other hand, that f(@) > 0. Then there ts a number 6 > 0 such 
that f(x) > 0 fora—6 <x <a+6 (Figure 3). Once again we know that there is 
an xg in A satisfying a — 6 < xg < a; but this means that f is negative on [a, xo], 
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which is impossible, since f(x9) > 0. Thus the assumption f(a) > 0 also leads to 
a contradiction, leaving f(@) = 0 as the only possible alternative. J 


The proofs of ‘Theorems 2 and 3 of Chapter 7 require a simple preliminary 
result, which will play much the same role as ‘Theorem 6-3 played in the previous 
proof. 

If f is continuous at a, then there is a number 6 > O such that f is bounded 
above on the interval (a — 5,a + 5) (see Figure 4). 


Since lim f(x) = f(a), there is, for every ¢ > 0, a 6 > O such that, for all x, 


if |x —a| < 6, then | f(x) — f(a)| < «. 


It is only necessary to apply this statement to some particular ¢ (any one will do), 
for example, ¢ = 1. We conclude that there is a 6 > 0 such that, for all x, 


if |v —a| <6, then | f(x) — f(a)| < 1. 


It follows, in particular, that if |x —a| < 6, then f(x) — f(a) < 1. ‘This completes 
the proof: on the interval (a — 6,a + 5) the function f is bounded above by 


flay+1. 


It should hardly be necessary to add that we can now also prove that f is 
bounded below on some interval (a — 6,a +4), and, finally, that f is bounded on 
some open interval containing a. 


A more significant pomt is the observation that if im, f(x) = f(a), then there 
ae f 


{+— 
a—-6d a até 
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is a 6 > O such that f is bounded on the set {x : a < x < a+}, and a similar 
observation holds if lm f(x) = f(b). Having made these observations (and 


y>b- 


assuming that you will supply the proofs), we tackle our second major theorem. 
If f is continuous on [a,b], then f is bounded above on [a, 5]. 


Let 
A= {x:a<x <band f is bounded above on [a, x]}. 


Clearly A 4 ¥ (since a is in A), and A is bounded above (by b), so A has a least 
upper bound a. Notice that we are here applying the term “bounded above” both 
to the set A, which can be visualized as lying on the horizontal axis, and to f, Le., 
to the sets {f(y): a < y < x}, which can be visualized as lying on the vertical axis 
(Figure 5). 

Our first step is to prove that we actually have a = b. Suppose, instead, that 
a <b. By Theorem | there is 5 > Osuch that f is bounded on (a—5,a+565). Since 
a is the least upper bound of A there is some xg in A satisfying a—5 < x9 <a. This 
means that f is bounded on [a, x9]. But if xy) 1s any number with a < x7 <a+6, 
then f is also bounded on [xo, x]. Therefore f is bounded on [a, x1], so x1 is 
in A, contradicting the fact that @ is an upper bound for A. This contradiction 
shows that a = b. One detail should be mentioned: this demonstration implicitly 
assumed that a < @ [so that f would be defined on some interval (@ — 5,a + 5)]; 
the possibility a = @ can be ruled out similarly, using the existence of a 5 > O such 
that f is bounded on {x :a <x <a+6}. 

The proof is not quite complete—we only know that f is bounded on [a. x] for 
every x < b, not necessarily that f is bounded on [a,b]. However, only one small 
argument needs to be added. 

There is a 6 > O such that f ts bounded on {x : b—6 < x < b}. There is x9 
in A such that b— 6 < x9 < b. Thus f is bounded on [a@, xo| and also on [.xo, 5], 
so f is bounded on [a,b]. § 


To prove the third important theorem we resort to a trick. 


If f is continuous on [a,b], then there ts a number y in [a,b] such that f(y) = 


f (x) for all x in [a, bd]. 


We already know that f is bounded on [a,b], which means that the set 
{f@) -x m [a. b]} 


is bounded. This set is obviously not 4, so it has a least upper bound a. Since 
a > f(x) for x in [a,b] it suffices to show that @ = f(y) for some y in [a,b]. 

Suppose instead that a 4 f(y) for all ym [a,b]. Then the function ¢ defined 
by 


(x) = ————, _ xmfa.b 
g eT [a. D| 


138 Foundations 


THEOREM 2 


PROOF 


is continuous on [a, b}, since the denominator of the right side is never 0. On the 
other hand, @ is the least upper bound of { f(x) : x in [a, 5] }; this means that 


for every ¢ > O there ISX in) |@. | wah = of (x) <_€. 
This, in turn, means that 
for every ¢ > 0 there is x in [a, b] with g(x) > I/e. 


But this means that g is not bounded on [a,b], contradicting the previous theo- 
rem. f 


At the beginning of this chapter the set of natural numbers N was given as an 
example of an unbounded set. We are now going to prove that N is unbounded. 
After the difficult theorems proved in this chapter you may be startled to find 
such an “obvious” theorem winding up our proceedings. Ifso, you are, perhaps, 
allowing the geometrical picture of R to influence you too strongly. “Look,” you 
may say, “the real numbers look hike 


—— aS 
0 | 2 3) ie x =A 


so every number x is between two integers n,n + | (unless x is itself an mteger).” 
Basing the argument on a geometric picture is not a proof, however, and even the 
geometric picture contains an assumption: that if you place unit segments end-to- 
end you will eventually get a segment larger than any given segment. This axiom, 
often omitted from a first introduction to geometry, is usually attributed (not quite 
justly) to Archimedes, and the corresponding property for numbers, that N is not 
bounded, is called the Archimedean property of the real numbers. ‘This property 1s noé 
a consequence of PI P12 (see reference [14] of the Suggested Reading), although 
it does hold for Q, of course. Once we have P13 however, there are no longer 
any problems. 


N is not bounded above. 


Suppose N were bounded above. Since N ¥ ¥, there would be a least upper 
bound @ for N. ‘Then 


a>n for all n nN. 


Consequently, 


a>n+1 forallnmN, 
since a+ Lis in N if nis im N. But this means that 
a—-d>m forall fin N, 


and this means that a — | is also an upper bound for N, coutradicting the fact that 
a is the least upper bound. 


THEOREM 3 


PROOF 
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There is a consequence of Theorem 2 (actually an equivalent formulation) which 
we have very often assumed implicitly. 


For any € > O there is a natural number nv with I/n < e,. 


Suppose not; then I/n > e€ for all nin N. Thus» < 1/e for all nin N. But this 
means that }/¢ is an upper bound for N, contradicting Theorem 2. § 


A brief glance through Chapter 6 will show you that the result of Theorem 3 
was used in the discussion of many examples. Of course, Theorem 3 was not 
available at the time, but the examples were so important that in order to give 
them some cheating was tolerated. As partial justification for this dishonesty we 
can claim that this result was never used in the proof of a ¢heorem, but if your faith 
has been shaken, a review of all the proofs given so far is in order. Fortunately, 
such deception will not be necessary again. We have now stated every property of 
the real numbers that we will ever need. Henceforth, no more hes. 


PROBLEMS 


1. Find the least upper bound and the greatest lower bound (if they exist) of 
the following sets. Also decide which sets have greatest and least elements 
(i.e., decide when the least upper bound and greatest lower bound happens 
to belong to the set). 


(1) {os in NI 


i 


| 
(ii) Fs >nin Zandn FZ of. 


iH 


(iit) [4 % = Osorno / for some 4 iN}. 
(iw) xg Oe x = J/2 and x is rational}. 

Oo ete: Keck & tel =O). 

(vi) (bere? +x = 1 <0}. 

(vit (oe = Omid va — 1 =O) 


(viii) Fs +(-—1)":nin N}. 
2. (a) Suppose A # QM is bounded below. Let —A denote the set of all —x 
for x in A. Prove that —A 4 %, that —A is bounded above, and that 

— sup(—A) 1s the greatest lower bound of A. 
(b) If A # ¥1s bounded below, let B be the set of all lower bounds of A. 
Show that B 4 ¥, that B is bounded above, and that sup B 1s the greatest 


lower bound of A. 
3. Let f be a contmuous function on [a,b] with f(a) <0 < f(b). 
(a) “Phe proof of Vheorem 7-1 showed that there is a smallest x in [a. b| 
with f(x) = 0.slf there 1s more than one-x in [a,b] wath f(x) = 0, 
is there necessarily a second smallest? Show that there is a largest x in 
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[a,b] with f(x) = 0. (Try to give an easy proof by considering a new 
function closely related to f.) 

(b) ‘The proof of Theorem 7-1 depended upon considering A = {x : a < 
x < band f is negative on [a, x]}. Give another proof of Theorem 7-1, 
which depends upon consideration of B = {x (a =x = bande 
O}. Which point + in [a,b] with f(+) = 0 will this proof locate? Give 
an example where the sets A and B are not the same. 


(a) Suppose that f is continuous on [a,b] and that f(a) = f(b) = 0. 
Suppose also that f(x%9) > 0 for some xo in [a,b]. Prove that there are 
mamibers ¢ and) d witha =<¢ <x <d <= b suchainat f(c) = A(d)=—VO. 
but f(x) > 0 for all x m (c,d). Hint: The previous problem can be used 
to good advantage. 

(b) Suppose that f ts continuous on [a, b] and that f(a) < f(b). Prove that 
fhetecane numbers.c and d with a@ <c <d = b sughsthat fc) Alay 


AUGiatde =) (Oy and fiai= f(x) = 7d) lor all yetaleed), 


(a) Suppose that y — x > 1. Prove that there is an integer k such that 
a <k < y. Hmt: Let d be the largest integer satisfying / < xfand 
consider / + I. 

(b) Suppose x < y. Prove that there is a rational number r such that x < 
r<y. Hint: If 1/n < y—x, thenny—nx > 1. (Query: Why have parts 
(a) and (b) been postponed until this problem set?) 

(c) Suppose that r < s are rational numbers. Prove that there is an irrational 
number between r and s. Hint: As a start, you know that there is an 
irrational number between 0 and 1. 

(d) Suppose that x < y. Prove that there is an irrational number between x 
and y. Hint: It is unnecessary to do any more work; this follows from 


(b) and (c). 


A set A of real numbers is said to be dense if every open interval contains a 
point of A. For example, Problem 5 shows that the set of rational numbers 
and the set of irrational numbers are each dense. 


(a) Prove that if f is contnuous and f(x) = 0 for all numbers x in a dense 
Sete then Ff (x) = OMor allex: 

(b) Prove that if f and g are contmuous and f(x) = g(x) for all x ina dense 
stt “Ae tacuey (y= ery tor all x 

(c) Ifwe assume instead that f(x) > g(x) for all x in A, show that f(x) = 


g(x) for all x. Can > be replaced by > throughout? 


Prove that if f is continuous and f(x + vy) = f(x) + fQ’) for all x and y, 
then theré%is a ntimber. ¢ such that (7) = cw for all a> (his conclusion 
can be demonstrated simply by combining the results of two previous prob- 
lems.) Pomt of information: ‘There do exist noncontinuous functions f satisfying 
f(x ty) = f(x) + f(y) for all x and y, but we cannot prove this now; In 
fact, this simple question mvolves ideas that are usually never mentioned in 
undergraduate courses (see reference [7] in the Suggested Reading). 
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*8. Suppose that f is a function such that f(a) < f(b) whenever a < b (Fig- 
ure 6). 


a) Prove that lim f(x) and lim f(x) both exist. Hint: Why is this prob- 


A 1) T x—a 
lem m this chapter? 
b) Prove that f never has a removable discontinuity (this termimology comes 
from Problem 6-17). 


c) Prove that if f satisfies the conclusions of the Intermediate Value ‘The- 


orem, then f is continuous. 


*9. lf fms a boundeds function om [0; 1), let alg ||| = sup{ LF Gol = am (0, |}. 


Prove analogues of the properties of |] || in Problem 7-14. 


10. Suppose a > 0. Prove that every number x can be written uniquely in the 
form x = ka +x’, where k is an integer, and O < x’ <a. 


11. (a) Suppose that ay, a2,a3,... 18 a sequence of positive numbers with 
Qn41 < 4,/2. Prove that for any ¢ > 0 there is some n with a, < €. 


FIGURE 6 


(b) Suppose P is a regular polygon inscribed inside a circle. If P’ is the 
inscribed regular polygon with twice as many sides, show that the differ- 
ence between the area of the circle and the area of P’ is less than half the 
difference between the area of the circle and the area of P (use Figure 7). 

(c) Prove that there is a regular polygon P inscribed in a circle with area 
as close as desired to the area of the circle. In order to do part (c) you 
will need part (a). This was clear to the Greeks, who used part (a) as the 
basis for their entire treatment of proportion and area. By calculating 
the areas of polygons, this method (“the method of exhaustion”) allows 
computations of z to any desired accuracy; Archimedes used it to show 


Nae a) : 5 ; : 
that 4> < 7 < =. But it has far greater theoretical importance: 


one side of P 


sides of P’ 


*(d) Using the fact that the areas of two regular polygons with the same num- 

ber of sides have the same ratio as the square of their sides, prove that the 
euRE? areas of two circles have the same ratios as the square of their radii. Hint: 
Deduce a contradiction from the assumption that the ratio of the areas 
is greater, or less, than the ratio of the square of the radu by inscribing 
appropriate polygons. 


12. Suppose that A and B are two nonempty sets of numbers such that ~ < y 
for all x in A and all y in B. 


(a) Prove that sup A < y for all y in B. 
(b) Prove that sup A < inf B. 


13. Let A and B be two nonempty sets of numbers which are bounded above, and 
let A+B denote the set of all numbers x+y with x m A and y in B. Prove that 
sup(A+B) = sup A+sup B. Hint: ‘The mequality sup(A+B) < sup A+sup B 
is casy. Why? ‘To prove that sup A + sup B < sup(A + B) it suffices to prove 
that sup A + sup B < sup(A + B)+ € for all e > 0; begin by choosing x. in A 
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The simple result of Problem 14(a) is called the 


aud y in B with sup A — x < €/2 and sup B — y < e/2. 
CURE (a—_—<———-|—> + k aeons] 


(a) Gonsidemassequenee of closedsintenvals fj) = (ays by), lo =all@aeba lea. 


Suppose that ay < dyj41 and b,4,; < b, for all n (Figure 8). Prove that 
there is a point x which is m every Jy. 

(b) Show that this conclusion is false if we consider open intervals instead of 
closed intervals. 


6 


Nested Interval ‘Theorem.” It 


may be used to give alternative proofs of Theorems | and 2. ‘The appropriate 


reasoning, outlined in the next two problems, illustrates a general method, called 


a “bisection argument.” 


=D .. 


eL6t 


ey: 


“1S, 


Suppose f is continuous on [a,b] and f(a) < 0 < f(b). Then either 
f((a + b)/2) = 0, or f has different signs at the end points of the interval 
[a, (a + b)/2], or f has different signs at the end points of [(a + b)/2,b]. 
Why? If f((a + b)/2) 4 0, let 1; be the interval on which f changes sign. 
Now bisect J). Either f is 0 at the midpomt, or f changes sign on one of the 
two intervals. Let /> be that interval. Continue in this way, to define /, for 
each n (unless f is 0 at some midpoint). Use the Nested Interval Theorem 
to find a point x where f(x) = 0. 


Suppose f were continuous on [a,b], but not bounded on [a,b]. Then f 
would be unbounded on either [a, (a+)/2] or |(a+b)/2, b]. Why? Let /; 
be one of these intervals on which f 1s unbounded. Proceed as in Problem 15 
to obtain a contradiction. 


(a) Let A= {x :x <a}. Prove the following (they are all easy): 


Ge Ti x isan A andewcd xyatheneyers mA 


(ine aa). 
Gi) AZAR. 
(iv) If x is in A, then there is some number x’ in A such that x < x’. 
(b) Suppose, conversely, that A satisfies (i)-(iv). Prove that A = {x : x < 
sup A}. 


A number x is called an almost upper bound for A if there are only 
finitely many numbers y in A with y > x. An almost lower bound is 
defined siniilarly. 


(a) Find all almost upper bounds and almost lower bounds of the sets im 
Problem 1. 

(b) Suppose that A is a bounded infinite set. Prove that the set B of all 
almost upper bounds of A is nonempty, and bounded below. 


ales 


20. 
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(c) It follows from part (b) that inf B exists; this number is called the limit 
superior of A, and denoted by lim A or lim sup A. Find lim A for each 
set A in Problem 1. 

(d) Define lim A, and find it for all A in Problem 1. 


If A is a bounded infinite set prove 
(a) lim A <limA. 


(b) lim A < sup A. 


(c) If lim A <supA, then A contains a largest element. 
(d) The analogues of parts (b) and (c) for lim. 


ee be a 


shadow points 


FIGURE 9 


Let f be a continuous function on R. A point x 1s called a shadow point 
ot f if thereus » number vy > + with f(y) > f@). The rationnlesor this 
terminology is mdicated in Figure 9; the parallel lines are the rays of the sun 
rising m the east (you are facing north). Suppose that all points of (a,b) are 
shadow pomts, but that a and b are not shadow pomts. Clearly, f(a) > f(b). 


(a) Suppose that f(a) > f(b). Show that the point where f takes on its 
maximum value on [a,b] must be a. 
(b) ‘Phen show that this leads to a contradiction, so that in fact we must have 


f(a) = f(b). 


This little result, known as the Rising Sun Lemma, is instrumental m 
proving several beautiful theorems that do not appear im this book; see 


page 450. 
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DEFINITION 


APPENDIX. UNIFORM CONTINUITY 


Now that we’ve come to the end of the “foundations,” it night be appropriate 
to slip in one further fundamental concept. ‘This notion is not used crucially in 
the rest of the book, but it can help clarify many poimts later on. 

We know that the function f(x) = x? is continuous at a for all a. In other 
words, 


if a is any number, then for every € > 0 there is some 6 > 0 
~ . 9} 5), 
such that, for all x, if |x —a| < 5, then |x* —a-| <e. 


Of course, 6 depends on ¢. But 6 also depends on athe 6 that works at a might 
not work at b (Figure 1). Indeed, it’s clear that given ¢ > 0 there is no one 6 > 0 
that works for all a, or even for all positive a. In fact, the number a + 6/2 will 
certainly satisfy |x — a| < 6, but if a > 0, then 


ey 2\= a > ad 
ss @~ | = |e i as ee 


and this won't be < ¢ once a > €/8. (This is just an admittedly confusmg compu- 
tational way of saying that f is growing faster and faster!) 

On the other hand, for any ¢ > 0 there will be one 5 > 0 that works for all a 
in any interval [—N, N]. In fact, the 6 which works at N or —N will also work 
everywhere else in the interval. 

As a final example, consider the function f(x) = sin 1/x, or the function whose 
eraph appears in Figure 18 on page 62. It is easy to see that, so long as € < i 
there will not be one 5 > 0 that works for these functions at all points a m the 
open interval (0, 1). 

These examples illustrate important distinctions between the behavior of various 
continuous functions on certain intervals, and there is a special term to signal this 
distinction. 


The function f is uniformly continuous onan interval A if for every ¢ > 0 
there is some 6 > O such that, for all x and y in A, 


it (vty | es Ontienmi Gy y CY) IP— ie. 


We've seen that a function can be continuous on the whole line, or on an open 


interval, without being uniformly continuous there. On the other hand, the func- 
fiom (0) — x? did turn out to be uniformly continuous on any closed interval. 
This shouldn’t be too surprising it’s the same sort of thing that occurs when we 
ask whether a function is bounded on an interval aud we would be led to suspect 
that any continuous function on a closed interval is also uniformly continuous on 
that interval. In order to prove this, we'll need to deal first with one subtle point. 

Suppose that we have two intervals [a,b] and [P. ce] with the common end- 
point b, and a function f that is continuous on [a,c]. Let & > 0 and suppose that 


<6 
er , 
oo” 
a i b y G 


EG, CURE 2 


LEMMA 


PROOF 


THEOREM 1 


PROOF 
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the following two statements hold: 


(1) if x and y are in [a, b| and |x — y| < 4), then |f(x)— f(y) <e«, 
(11) if mand y are im [b,c] and |x — yj— 65. then if @) Saat ee: 


We'd like to know if there is some 6 > 0 such that | f(x) — f(y)| < & whenever 
x and y are points in [a,c] with |x — y| < 6. Our first inclination might be to 
choose 6 as the minimum of 6; and 42. But it 1s easy to see what goes wrong 
(Figure 2): we might have x in [a,b] and y in [b,c], and then neither (1) nor (1) 
tells us anything about | f(x) — f(y)|. So we have to be a little more cagey, and 
also use continuity of f at b. 


Let a < b < c and let f be continuous on the interval [a,c]. Let ¢ > 0, and 
suppose that statements (i) and (1) hold. Then there is a 6 > O such that, 


if x and y are in [a,c] and |x — y| < 6, then | f(x) — f(Q)| < «. 


Since f is continuous at b, there is a 63 > O such that, 


i {x — b| < d3, thens| f(#)— / ©)| <= >. 


It follows that 
(mi) if |x —b| < 63 and |y — b| < 43, then | f(x) — fQ)| <e. 


Choose 6 to be the minimum of 4), 62, and 463. We claim that this 6 works. In 
fact, suppose that x and y are any two points in [a,c] with |x — y| < 6. If x and y 
are both in [a, b], then | f(x) — f(y)| < € by (1); and if x and y are both in [b,c], 
then | f(x) — f(y)| < & by (i). The only other possibility is that 


eerie ay or ere @ 
In either case, since |x — y| < 6, we also have |x — b| < 6 and |y — b| < 6. So 
LAC) — fOr 2 by Gu). J 


If f is continuous on [a, b], then f is uniformly continuous on [a, b]. 


It’s the usual trick, but we've got to be a little bit careful about the mechanism of 
the proof. For e > 0 let’s say that f is e-good on [a, b| if there is some 6 > 0 such 
that, for all y and z in [a, bd], 
if |y —z| < 6, then |f(y) — f(s)| <.e. 
Then we're trying to prove that f is e-good on [a,b] for all ¢ > 0. 
Consider any particular ¢ > 0. Let 
A={x:a<x <band f is €-good on [a. x]}. 


Then A # & (since a is in A), and A is bounded above (by b), so A has a least 
upper bound @. We really should write @,, since A and @ might depend on ¢. But 
we won't since we intend to prove that @ = b, no matter what ¢ is. 
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Suppose that we had @ < b. Since f is continuous at a, there is some dg > O 
such that, if |y —a@| < do, then | f(y) — f(@)| < €/2. Consequently, if |y —a| < d9 
and |z —a@| < do, then | f(y) — f(z)| < ¢. So f is surely ¢-good on the interval 
[~ — d9.a@ + d9]. On the other hand, since @ is the least upper bound of A, it 
is also clear that f is e-good on [a,@ — do]. Then the Lemma implies that f is 
e-good on [a.a@ + do], so a + dg 1s in A, contradicting the fact that @ is an upper 
bound. 

To complete the proof we just have to show that @ = b is actually in A. ‘The 
argument for this 1s practically the same: Since f 1s continuous at b, there is some 
d9 > 0 such that, if b — 69 < y < b, then | f(y) — f(b)| < €/2. So f is €-good on 
[b — d9, b]. But f is also e-good on [a,b — dg], so the Lemma implies that f is 
é-good on [a,b]. §f 


PROBLEMS 


® uni- 


1. (a) For which of the followmg values of @ is the function f(x) = x 
formly contimimous on [0;.co); a =eh/35 1/2, 2, 3? 
(b) Find a function f that is contmuous and bounded on (0, 1], but not 
uniformly continuous on (Q, I]. 
(c) Find a function f that is continuous and bounded on [Q, 00) but which 


is not uniformly continuous on [0, 00). 


2. (a) Prove that if f and g are uniformly continuous on A, then so is f + g. 
(b) Prove that if f and g are uniformly continuous and bounded on A, then 
fg is uniformly continuous on A. 
(c) Show that this conclusion does not hold if one of them isn’t bounded. 
(d) Suppose that f is uniformly continuous on A, that g is uniformly con- 
tinuous on B, and that f(x) is in B for all x in A. Prove that go f is 
uniformly continuous on A. 


3. Use a “bisection argument” (page 142) to give another proof of Theorem I. 


4. Derive Theorem 7-2 as a consequence of Theorem 1. 
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In 1604, at the height of 

his scientific career, Galileo argued 
that for a rectilinear motion 

in which speed increases proportionally 
to distance covered, 

the law of motion should be 

est Wiat (% =i) 

which he had discovered 

in the investigation of falling bodies. 
Between 1695 and 1700 

not a single one of the monthly issues 
of Leipzig’s Acta Eruditorum was published 
without articles of Leibniz, 

the Bernoull: brothers 

or the Marquis de (Hopital treating, 
with notation only slightly different from 
that which we use today, 

the most varied problems of 
differential calculus, integral calculus 
and the calculus of variations. 

Thus in the space of almost precisely 
one century 

infinitesimal calculus or, 

as we now call it in English, 

The Calculus, 

the calculating tool par excellence, 

had been forged; 

and nearly three centuries of 

constant use have not completely dulled 
this incomparable instrument. 


NICHOLAS BOURBAKI 
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CHAPTER 


DERIVATIVES 


The derivative of a function ts the first of the two major concepts of this section. 
Together with the mtegral, it constitutes the source from which calculus derives 
its particular flavor. While it is true that the concept of a function 1s fundamental, 
that you cannot do anything without limits or continuity, and that least upper 
bounds are essential, everything we have done until now has been preparation-—if 
adequate, this section will be easier than the preceding ones— for the really exciting 
ideas to come, the powerful concepts that are truly characteristic of calculus. 

Perhaps (some would say “certainly”) the interest of the ideas to be introduced 
in this section stems from the timate connection between the mathematical con- 
cepts and certain physical ideas. Many definitions, and even some theorems, may 
be described in terms of physical problems, often im a revealing way. In fact, the 
demands of physics were the original mspiration for these fundamental ideas of 
calculus, and we shall frequently mention the physical interpretations. But we 
shall always first define the ideas in precise mathematical form, and discuss their 
significance in terms of mathematical problems. 

The collection of all functions exhibits such diversity that there is almost no 
hope of discovermg any interesting general properties pertaining to all. Because 
continuous functions form such a restricted class, we might expect to find some 
nontrivial theorems pertaining to them, and the sudden abundance of theorems 
afier Chapter 6 shows that this expectation 1s justified. But the most interesting 
and most powerful results about functions will be obtained only when we restrict 
our attention even further, to functions which have even greater claim to be called 
“reasonable,” which are even better behaved than most continuous functions. 


f(x) = |x|, x 20 
fQy= x*,x <0 
f(x) = |x| 
f(x) = vir 
FIGURE | (a) (b) (c) 


Figure | illustrates certam types of misbehavior which continuous functions can 
display. The graphs of these functions are “bent” at (0,0), unlike the graph of 
igure 2, where it is possible to draw a “tangent line” at each point. The quotation 
marks have been used to avoid the suggestion that we have defined “bent” or 
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“tangent line,” although we are suggesting that the graph might be “bent” at a 
pomt where a “tangent line” cannot be drawn. You have probably already noticed 
that a tangent line cannot be defined as a line which intersects the graph only 
once—such a definiuon would be both too restrictive and too permissive. With 


such a definition, the straight line shown in Figure 3 would not be a tangent line 
to the graph in that picture, while the parabola would have two tangent lines at 
each point (Figure 4), and the three functions in Figure 5 would have more than 
FIGURE 3 one tangent line at the points where they are “bent.” 


(a) (b) (c) 
FIGURE 5 
PRC ORE A A more promising approach to the definition of a tangent line might start with 


“secant lines,” and use the notion of limits. If 4 4 0, then the two distinct points 
(a, f(a)) and (a +h, f(a +h)) determine, as in Figure 6, a straight line whose 
slope 1s 
flat+h) — f(a) 
h 


(ath, f(a+h)) 


flat+h)— fa 


(a, f(a)) 


FIGURE 6 


As Figure 7 illustrates, the “tangent line” at (a, f(@)) seems to be the limit, m 
some sense, of these “secant lines,” as ft approaches 0. We have never before 
talked about a “Hnut” of lines, but we can talk about the limit of their slopes: the 


DEFINITION 
A 
(a, f(a)) 
ESSESEI&I2I]EE EEE 
a 
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slope of the tangent line through (a. f(a)) should be 
mm Lia 2) — 7) 
lin. ———__—_—_ 
h—0 h 


We are ready for a definition, and some comments. 


The function f is differentiable at a if 


. fla+h)— fla) 
im —————_ 


exists. 
h-0 h 


In this case the limit is denoted by f’(a) and is called the derivative of f at 
a. (We also say that f is differentiable if f is differentiable at a for every a 
in the domain of f/f.) 


The first comment on our definition is really an addendum; we define the 
tangent line to the graph of f at (a, f(a)) to be the line through (a, f(a)) 
with slope f’(a). This means that the tangent line at (a, f(a)) is defined only if 
f is differentiable at a. 

The second comment refers to notation. “‘Vhe symbol f’(a) is certainly rem- 
iniscent of functional notation. In fact, for any function f, we denote by f’the 
function whose domain is the set of all numbers a such that f is differentiable 
at a, and whose value at such a number a is 


. flat+h)— fa) 
i, ——_— 
h-0 h 


(Io be very precise: f’ is the collection of all pairs 


( . flath)— a) 
(GL. lim (a 


h=0 h 
for which lim | f(a+h)— f(a)]/h exists.) The function f’ is called the derivative 


h—0 
of f. 

Our third comment, somewhat longer than the previous two, refers to the phys- 
ical interpretation of the derivative. Consider a particle which is moving along a 
straight line (Figure 8(a)) on which we have chosen an “origin” point O, and a 
direction in which distances from O shall be written as positive numbers, the dis- 
tance from O of points in the other direction bemg written as negative numbers. 
Let s(t) denote the distance of the particle from O, at time rt. ‘The suggestive nota- 
tion s(t) has been chosen purposely; since a distance s(t) is determined for each 


; : if = 0) i — | 
motion of the particle 
fo PF '4 1=3 yr=2 
+++ | 


line along which particle is moving 
FIGURE 8(a) 
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“distance” 


FIGURE 8(b) 


“time” 


graph of s 


number ¢, the physical situation automatically supplies us with a certain function s. 
The graph of s indicates the distance of the particle from O, on the vertical axis, 
in terms of the time, indicated on the horizontal axis (Figure 8(b)). 
The quotient 
s(at+h) — s(a) 
h 


has a natural physical interpretation. It is the “average velocity” of the particle 
during the time interval from a to a +h. For any particular a, this average speed 
depends on hf, of course. On the other hand, the limit 


. Ss(ath) —s(a) 
hing —— 
h-0O h 


depends only on a (as well as the particular function s) and there are important 
physical reasons for considering this limit. We would like to speak of the “velocity 
of the particle at time a,” but the usual definition of velocity is really a definition 
of average velocity; the only reasonable definition of “velocity at time a” (so-called 
“instantaneous velocity”) is the limit 


. sath) —s(a) 
0 
h—0O h 


Thus we define the (instantaneous) velocity of the particle at a to be s’(a). 
Notice that s’(a) could easily be negative; the absolute value |s’(a)| 1s sometimes 
called the (instantaneous) speed. 

It is important to realize that mstantaneous velocity is a theoretical concept, 
an abstraction which does not correspond precisely to any observable quantity. 
While it would not be fair to say that instantaneous velocity has nothing to do 
with average velocity, remember that s(t) 1s not 


s(t +h) —s(t) 
h 


for any particular fh, but merely the limit of these average velocities as fh ap- 
proaches 0. ‘Thus, when velocities are measured in physics, what a physicist really 
measures is an average velocity over some (very small) time interval; such a pro- 
cedure cannot be expected to give an exact answer, but this is really no defect, 
because physical measurements can never be exact anyway. 

The velocity ofa particle is often called the “rate of change of its position.” ‘This 
notion of the derivative, as a rate of change, applies to any other physical situation 


» 


in which some quantity varies with time. For example, the “rate of change of 
mass” of a growmeg object means the derivative of the function m, where m(t) 1s 
the mass at time f. 

In order to become familiar with the basic definitions of this chapter, we will 
spend quite some time examining the derivatives of particular functions. Before 
proving the important theoretical results of Chapter 11, we want to have a good 
idea of what the derivative of a function looks like. “Phe next chapter ts devoted 
exclusively to one aspect of this problem calculating the derivative of compli- 
cated functions. In this chapter we will emphasize the concepts, rather than the 


PEG RE 9 


slope 2a 


(a,a°) 
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calculations, by considermg a few simple examples. Simplest of all is a constant 
function, f («) =c.sin this ¢ase 

flat+h) — fla) Se 
a 


hi = lim 
h—0O h ho0 h 


=). 


Thus f is differentiable at a for every number a, and f’(a) = 0. This means that 
the tangent line to the graph of f always has slope 0, so the tangent line always 
coincides with the graph. 

Constant functions are not the only ones whose graphs coincide with their tan- 
gent lines—this happens for any linear function f (+) = cx + d. Indeed 


; _™. flath) — f(a) 
i tay= him — 


1-0 h 
.. e(a@ +h) +d — [ca yd] 
= loa == — = 
h-0 h 
Loch 
= himé—= =c: 
oO) if 


the slope of the tangent line is c, the same as the slope of the graph of f. 
A refreshing difference occurs for f(x) = x7. Here 


7 Gay — lim 
h0 


h>0 h 
— at +2ah +h? — a? 
= 
h—>0 h 
= lim 2a +h 


h—> 


La 


Some of the tangent lines to the graph of f are shown in Figure 9. In this picture 
each tangent line appears to intersect the graph only once, and this fact can be 
checked fairly easily: Since the tangent line through (a, a7) has slope 2a, it is the 
graph of the function 


s 

2a(x — a) +a~ 
2» 3 

= 2ax —a’. 


g(x) 


Now, if the graphs of f and g intersect at a point (x. f(x)) = @, g(x)), then 


x? = 2ax — a’ 
or x?—2ax+a*=0: 
so (x — @)2 =) 
ees — 
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The function f(«) = x- happens to be quite special in this regard; usually a 
tangent line will intersect the graph more than once. Consider, for example, the 
function f(x) = x?. In this case 


{_(@) = im Ja + = {1a) +) = £@) 


h—-0 h 
_ @+thy—a 
Selina —o_| é  — 
h—>0 h 
— a+3a2h + 3ah? +43 -— 23 
= hn 
h-+0 h 
— 3a2h +3ah? +43 
= hm ———————__——_ 
h—-0 h 
— ote 3a? + 3ah 4h 
h— 
iat 


Thus the tangent line to the graph of f at (a, a?) has slope 3a*. This means that 
the tangent line 1s the graph of 


oxi 3a2(x —ata@ 
= 3q*%— 2a” 


The graphs of f and g intersect at the point (x, f(x)) = (x. g(x)) when 


x? =3a*x —2@° 
3) 
OF ee sBaateo 


This equation ts easily solved if we remember that one solution of the equation 
has got to be x = a, so that (x — a) 1s a factor of the left side; the other factor can 
then be found by dividing. We obtain 


(x — a)(x- + ax — 2a’) = (). 


6) 3) . ~ 
It so happens that x* + ax — 2a~ also has x — a as a factor; we obtain finally 


(x — a)(x — a)(x + 2a) = 0. 


9) 
slope 3a“ 


Thus, as illustrated in Figure 10, the tangent line through (a, a?) also intersects 
—2a a the graph at the pot (—2a, —8a3). These two points are always distinct, except 
= when a = 0. 

We have already found the derivative of sufficiently many functions to illustrate 
the classical, and sull very popular, notation for derivatives. For a given function f, 
the derivative f’ is often denoted by 


df (x) 


dx 


mOWRE Lo lor example, the symbol 


ax* 


dx 


— 
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. . . o) 
denotes the derivative of the function f(x) = x-. Needless to say, the separate 
parts of the expression 


df (x) 


dx 


are not supposed to have any sort of independent existence — the d’s are not num- 
bers, they cannot be canceled, and the entire expression is not the quotient of two 
other numbers “df(x)” and “dx.” This notation is due to Leibniz (generally 
considered an independent co-discoverer of calculus, along with Newton), and is 
affectionately referred to as Leibnizian notation.* Although the notation df (+)/dx 
seems very complicated, in concrete cases it may be shorter; after all, the symbol 
dx*/dx is actually more concise than the phrase “the derivative of the function 
SF Oe) iesae a 

The following formulas state in standard Leibnizian notation all the information 
that we have found so far: 


dc 
—— Q, 
dx 
d(ax + b) 
—————_ =a, 
dx 
g 
ee 
dx 
dx? = ay 
he 


Although the meaning of these formulas is clear enough, attempts at literal 
interpretation are hindered by the reasonable stricture that an cquation should 
not contain a function on one side and a number on the other. For example, if 
the third equation is to be true, then either df(x)/dx must denote f’(«), rather 
than f’, or else 2x must denote, not a number, but the function whose value at x 
is 2x. It is really impossible to assert that one or the other of these alternatives is 
intended; in practice df(x)/dx sometimes means f’ and sometimes means f’(x), 
while 2x may denote either a number or a function. Because of this ambiguity, 
most authors are reluctant to denote f’(a) by 


df (x) 


dx 


(a); 


instead f’(a) is usually denoted by the barbaric, but unambiguous, symbol 


df (x) 


dx 


X>=a 


* Leibniz was led to this symbol by his mtuitive notion of the derivative, which he considered to be, 
not the hmuit of quotients [f(x +h) — f(x)|/A, but the “value” of this quotient when /7 is an “infiniteh 
small” number. ‘Vhis “infimtely small” quantity was denoted by dx and the corresponding “infinitely 
small” difference f(v+dx)— f(x) by df (x). Although this point of view is impossible to reconcile with 
properties (P1)-(P13) of the real numbers, some people find this notion of the derivative congenial. 
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In addition to these difficulties, Leibnizian notation is associated with one more 
ambiguity. Although the notation dx*/dx is absolutely standard, the notation 
df (x)/dx is often replaced by df/dx. This, of course, is in conformity with the 
practice of confusing a function with its value at x. So strong is this tendency that 
functions are often indicated by a phrase like the following: “consider the function 


2» 


None We will sometimes follow classical practice to the extent of using y 
as the name of a function, but we will nevertheless carefully distinguish between 
the function and its values~—thus we will always say something like “consider the 


function (defined by) y(x) = ee 


Despite the many ambiguities of Leibnizian notation, it is used almost exclu- 
sively in older mathematical writing, and is still used very frequently today. The 
staunchest opponents of Leibnizian notation admit that it will be around for quite 
some time, while its most ardent admirers would say that it will be around for- 
ever, and a good thing too! In any case, Leibnizian notation cannot be ignored 
completely. 


The policy adopted in this book is to disallow Leibnizian notation within the 
text, but to include it in the Problems; several chapters contain a few (immediately 
recognizable) problems which are expressly designed to illustrate the vagaries of 
Leibnizian notation. ‘Trusting that these problems will provide ample practice in 
this notation, we return to our basic task of examining some simple examples of 
derivatives. 

The few functions examined so far have all been differentiable. ‘To fully ap- 
preciate the significance of the derivative it is equally important to know some 
examples of functions which are not differentiable. ‘The obvious candidates are the 
three functions first discussed in this chapter, and illustrated in Figure 1; if they 
turn out to be differentiable at 0 something has clearly gone wrong. 


Consider first f(x) = |x|. In this case 


fO+h)— fO) _ {hl 


h he 


Now |h|/h = 1 for h > 0, and |h|/h = —1 for h < 0. This shows that 


i 1) 0) 
1 <=... 


h—0O h 


does not exist. 


lint fact 


fh) — FO) — 


lim = | 
h>0+ h 
. f@=fO) 
and lin. ————_——— = —]. 
h—0 h 


(These two limits are sometimes called the right-hand derivative and the left- 
hand derivative, respectively, of f at 0.) 
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If a 40, then f’(a) does exist. In fact, 


F@)=1 if x > 0, 
f'@®)=-1 ifx <0. 


The proof of this fact is left to you (it is easy if you remember the derivative of a 


linear function). The graphs of f and of f’ are shown in Figure 11. 
gray] 3 i g 


bor the function 
XS: e=8 
ix) = = 


Xx. PSV. 


f'(O). We have 


a similar difficulty arises in connection with 


h? 
FIGURE 11 f(h) — f(O) 5 =, hh =D 
h ia 
as =A ls h > 0. 
h 
uy Therefore, 
‘(h) — f(O 
lim J Ah) ~ J) £@) = (0, 
h>0- h 
. fthy- ~O) 
but lim. —————_- = 1. 
h—>0? h 


Thus f'(0) does not exist; f is not differentiable at 0. Once again, however, f’(x) 
exists for x 4 Q—it is easy to see that 
re 
2x, x = 0 


FAB) = l, Suk 


The graphs of f and f’ are shown in Figure 12. 
Even worse things happen for f(x) = y |x|. For this function 
ae. 
‘ —<—= ; h>0O 
fih)—fO | 4 wh 
h Pee | 
h <0. 


FIGURE 12 


In this case the right-hand limit 


~~. Fe) =F) | 
lim ———— = lim — 
f(x) = y|x| h—0° h h+0+ /h 
does not exist; instead 1/h becomes arbitrarily large as h approaches 0. And, 


what's more, —1/—h becomes arbitrarily large in absolute value, but negative 


(Figure 13). 


FIGURE 13 
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f(x) =-Vx f(x) = Vx 


FIGURE 14 


THEOREM 1 


PROOF 


The function f(x) = Vx, although not differentiable at 0, is at least a little 
better behaved than this. ‘The quotient 


fcr) = FO) _ Va _ WI ! 


h -a ee eo ine 


simply becomes arbitrarily large as h goes to 0. Sometimes one says that f has 
an “infinite” derivative at 0. Geometrically this means that the graph of f has a 
“tangent hne” which is parallel to the vertical axis (Figure 14). Of course, f(x) = 


— Vx has the same geometric property, but one would say that f has a derivative 
of “negative infinity” at 0. 

Remember that differentiability is supposed to be an improvement over mere 
continuity. ‘This idea is supported by the many examples of functions which are 
continuous, but not differentiable; however, one important point remains to be 
noted: 


If f is differentiable at a, then f is continuous at a. 


hm f(a +h) — f(a) = hm AC) h 


h—-O0 h 
ey (Geel) eet Capea a 
= lim —————_.- lm h 
h-0 h h->0 
= ay) 
=. 


As we pointed out in Chapter 5, the equation lim f(a+h)— f(a) = 0 is equivalent 
h—> 


to lim f(x) = f(a); thus f is continuous at a. J 
pe — at dl 


It is very important to remember Theorem 1, and just as important to remember 
that the converse is not true. A differentiable function is continuous, but a con- 
tinuous function need not be diflerentiable (keep m mind the function f(x) = |x], 
and you will never forget which statement is true and which false). 

The continuous functions examined so far have been differentiable at all pomts 
with at most one exception, but it 1s easy to give examples of continuous functions 
which are not differentiable at several pomts, even an infinite number (Figure 15). 
Actually, one can do much worse than this. There is a function which ts continuous 


TAR IS 


9. Derivatives 159 


VV iV" 


(b) 


(c) * (d) 
FIGURE 16 


everywhere and differentiable nowhere! Unfortunately, the definition of this function will 
be inaccessible to us until Chapter 24, and I have been unable to persuade the 
artist to draw it (consider carefully what the graph should look like and you will 
sympathize with her point of view). It is possible to draw some rough approxima- 
tions to the graph, however; several successively better approximations are shown 
in Figure 16. 

Although such spectacular examples of nondifferentiability must be postponed, 
we can, with a little ingenuity, find a continuous function which is not differentiable 
at infinitely many points, a// of which are in [O, 1]. One such function is illustrated in 
Figure 17. The reader is given the problem of defining it precisely; it is a straight 
line version of the function 


an 
FOS es, 3 # () 


0, x= Q; 


This particular function f is itself quite sensitive to the question of differentiability. 
FIGURE 17 Indeed, for h 4 0 we have 
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al 
fh) — f() 7 ne 0 aie 1 
—___ = ——*—_ =sin - 


h h he 


Long ago we proved that lim sin 1// does not exist, so f is not differentiable at 0. 
} h—0O 


Geometrically, one can see that a tangent line cannot exist, by noting that the 
secant line through (0,0) and (A, f(/)) in Figure 18 can have any slope between 
—1 and 1, no matter how small we require h to be. 


l 
fixy=d* sin-, x #0 
JN =a) x 
0, oo 4)) 


FIGURE 18 


This finding represents something of a triumph; although continuous, the func- 
tion f seems somehow quite unreasonable, and we can now enunciate one math- 
ematically undesirable feature of this function—it is not differentiable at 0. Nev- 
ertheless, one should not become too enthusiastic about the criterion of differen- 
tiability. For example, the function 


sieell 
x-sin-—, x #0 
Xx 
Q), ~~. 


g(x) = 


is differentiable at 0; in fact g’(0) = 0: 


pie Ul 
_ g(h) — g(0) . h- sin E 
y= lim 
hee h ho0 A 
= lim Asin — 
h—O h 


=i 


The tangent line to the graph of g at (0,0) is therefore the horizontal axis (Fig- 
ure 19). 

This example suggests that we should seek even more restrictive conditions on a 
function than mere differentiability. We can actually use the derivative to formulate 
such conditions if we introduce another set of definitions, the last of this chapter. 
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PKC TERE, ES) 


For any function f, we obtain, by taking the derivative, a new function f” (whose 
domain may be considerably smaller than that of f). The notion of differentia- 
bility can be applied to the function f’, of course, yielding another function (f7)’, 
whose domain consists of all pomts a such that f’ is differentiable at a. ‘The func- 
tion (f’)' is usually written simply f” and is called the second derivative of f. 
If f’(a) exists, then f is said to be 2-times differentiable at a, and the number 
f(a) is called the second derivative of f at a. 

In physics the second derivative is particularly important. If s(t) 1s the posi- 
tion at time ¢ of a particle moving along a straight line, then s”(t) is called the 
acceleration at time ¢. Acceleration plays a special role in physics, because, as 
stated in Newton’s laws of motion, the force on a particle is the product of its mass 
and its acceleration. Consequently you can feel the second derivative when you 
sit in an accelerating car. 

There is no reason to stop at the second derivative we can define f”” = (f")’, 
f= (f'"Y, ete. This notation rapidly becomes unwieldy, so the followmg abbre- 
viation is usually adopted (it 1s really a recursive definition): 


fe = ae 
pa” = Gee. 
Thus 
fO= fla ry, 
fO=flayr'y, 
f® = ee = Go 


etc. 


ee 
a 
Ss 
| 
= 


The various functions f“, for k > 2, are sometimes called higher-order 
derivatives of f. 

Usually, we resort to the notation f only for k > 4, but it is convenient to 
have f™ defined for smaller k also. In fact, a reasonable definition can be made 
vas 


for , hamely, 


ee 
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Leibnizian notation for higher-order derivatives should also be mentioned. ‘The 
natural Leibnizian symbol for f(x), namely, 


d (=) 
dx 


dx 


F(x) =x? is abbreviated to 
d* f(x) 
(dx)? ° 


d* f(x) 


or more frequently to A 
xe 


Similar notation is used for f(x). 
The following example illustrates the notation f“’, and also shows, in one very 
(a) simple case, how various higher-order derivatives are related to the original func- 
tion. Let f(x) =x. Then, as we have already checked, 


fi eS 2x 
Daksa 
f(x) =HiN 


ff") =0, ifk > 3. 


Figure 20 shows the function f, together with its various derivatives. 
A rather more illuminating example is presented by the following function, 


(b) whose graph is shown in Figure 21 (a): 
? - 
f(x) = ; > : = 2 
f(x) =2 ae 
[ea a It is easy to see that 
f'@)= 2a “ifa>'O; 
= f'(a)=-—2a_ ifa <0. 
Moreover, 
(c) > (h) — f(O) 
*'(0) = lim f f 
h—0 h 
= tie) 
= lim : 
h>0 h 
Now 
Mix) =O, k > 3 : 
poe) = a ee 
———— lim = lim — =0 
r>0+ A h0+ 
. f) = =e 
jis and lim AY = hm —=0 
(d) ho0- h hs0- A 
FIGURE 20 sO ‘ns 
; 1) 
f (0) = lim a = () 
h-0 A 


This information can all be summarized as follows: 


Fixe) = 2x 


on x?, xr >0 
f(x) = ony) <0 
(a) 
fi) = 2\x| 
(b) 
, Oa 2 
_———— 
TG =—2, «<0 
(c) 


FIGURE 21 
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It follows that f”(O) does not exist! Existence of the second derivative is thus a 
rather strong criterion for a function to satisfy. Even a “smooth looking” function 
like f reveals some irregularity when examined with the second derivative. ‘This 


suggests that the irregular behavior of the function 


i ml 
x* sin =, Boesa0 
x 
0, P= 


might also be revealed by the second derivative. At the moment we know that 
g(O) = 0, but we do not know g’(a) for any a # 0, so it is hopeless to begin 
computing g”(O). We will return to this question at the end of the next chapter, 
after we have perfected the technique of finding derivatives. 


g&) = 


PROBLEMS 


1. (a) Prove, working directly from the definition, that if f(x) = 1/x, then 
f'(a) = —1/a’, fora £0. 
(b) Prove that the tangent line to the graph of f at (a, 1/a) does not intersect 
the graph of f, except at (a, 1/a). 


2. (a) Prove that if f(x) = 1/x?, then f’(a) = —2/a? for a £0. 
(b) Prove that the tangent line to f at (a, 1/a7) intersects f at one other 
point, which lies on the opposite side of the vertical axis. 


3. Prove that if f() = /x. then f’(a) = 1/(2/a), for a > 0. (The expression 
you obtain for [f(a +h) — f(a)|/h will require some algebraic face lifting, 
but the answer should suggest the right trick.) 


4, For each natural number n, let S,(x) = x”. Remembering that S;’(x) = I, 
So/(x) = 2x, and $3/(x) = 3x, conjecture a formula for S,'(x). Prove your 
conjecture. (The expression (x + hf)” may be expanded by the bmomial 
theorem.) 


5. _ Haney) at (o.) =i [ax |. 
6. Prove, starting from the definition (and drawing a picture to illustrate): 


(ali) — 7 (x) --c."then &’ (x)= (©: 
(Deer yi — cf (mathe s(x) = cf"). 


7 eesippose (iat fix) — x. 


Gye what is //O), f 25)N/’'C6)? 

(b) What is f’(37), f’(5*), £’(67)? 

(c) What is f’(a’), f’(x7)? 

If you do not find this problem silly, you are missing a very mportant point: 
TOs) means the derivative of f at the number which we happen to be 
calling x7; it is not the derivative at x of the function g(x) = f(x’). Just to 
drive the point honic: 


(d) For f(x) =x, compare f'(x7) and g(x) where g(x) = f(x”). 
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10. 


ik 


2. 


(a) Suppose g(x) = f(v+c). Prove (starting from the definition) that g’(x) = 
f'(« +e). Draw a picture to illustrate this. To do this problem you must 
write out the defmitions of g’(x) and f’(x +) correctly. ‘The purpose 
of Problem 7 was to convince you that although this problem is easy, it 1s 
not an utter triviality, and there is something to prove: you canot simply 
put prime marks into the equation g(x) = f(x +c). ‘To emphasize this 
point: 

(by Prove that if ¢(x) = flex), thene G@) =ic-47 (ea). Dey to see pictorially 
why this should be true, also. 

(c) Suppose that f is differentiable and periodic, with period a (ie., 
f(e+a) = ff) for ail x). Prove that f’ is also periodic. 


Find f’(x) and also f’(x + 3) in the following cases. Be very methodical, 
or you will surely slp up somewhere. Consult the answers (after you do the 
problem, naturally). 


i me a Oe 3)2. 

(eo) — 2 

(ii) fe +3)=(+5)’. 

Find f’(x) if f(x) = g(t +x), and if f(t) = g(t +x). The answers will no/ 
be the same. 


(a) Prove that Galileo was wrong: ifa body falls a distance s(t) in f seconds, 
and s’ is proportional to s, then s cannot be a function of the form 
Sete. 

(b) Prove that the following facts are true about s if s(t) = (a/2)t? (the first 
fact will show why we switched from ¢ to a/2): 


(i) s”(t) =a (the acceleration is constant). 
(ii) © [s'(t)]? = 2as(n). 


(c) If s is measured in feet, the value of a is 32. How many seconds do you 
have to get out of the way of a chandeher which falls from a 400-foot 
ceilmg? If you don't make it, how fast will the chandelier be gomg when 
it hits you? Where was the chandeher when it was moving with half that 
speed? 


hnagine a road on which the speed lmnit is specified at every smgle poimt. In 
other words, there is a certain function L such that the speed mit x mules 
from the beginning of the road is L(x). ‘Two cars, A and B, are driving along 
this road; car A’s position at time f is a(t), and car B’s is b(t). 


(a) What equation expresses the fact that car A always travels at the speed 
limit? (The answer is nol a’(t) = L(t).) 

(b) Suppose that A always goes at the speed limit, and that B’s position at 
time f is A’s position at time t—1. Show that B is also going at the speed 
limit at all times. 

(c) Suppose, mstead, that B always stays a constant distance behind A. Un- 
der what conditions will B sull always travel at the speed hmit? 


4 


13. 


14. 


15. 


16. 
We 


*18. 


1: 


20. 


PGs REE 22 
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Suppose that f(a) = g(a) and that the left-hand derivative of f at a equals 
the right-hand derivative of g at a. Define h(x) = f(x) for x < a, and 
h(x) = g(x) for x > a. Prove that / is differentiable at a. 


Lege = x? if x is rational, and f(x) = 0 if x is irrational. Prove that 
f is diflerentiable at 0. (Don’t be scared by this function. Just write out the 


definition of f’(0).) 


(a) Let f be a function such at |f(x)| < x* for all x. Prove that f is 
diflerentiable at 0. (If you have done Problem 14 you should be able to 
do this.) 

(b) This result can be generalized if x7 is replaced by |g(x)|, where g has 
what property? 


Leta > 1. If f satisfies | f(x)| < |x|“, prove that f is differentiable at 0. 


Lev @ < Bi) Prove thatif f satisfies | f(x) > |x|? and (@) = Onticen 7 
is not diflerentiable at 0. 


Let f(x) = 0 for irrational x, and 1/qg for x = p/q in lowest terms. Prove 
that f is not differentiable at a for any a. Hint: It obviously suflices to prove 
this for irrational a. Why? If a@ = m.ajaza3... 1s the decimal expansion 
of a, consider [ f(a +h) — f(a)]/h for h rational, and also for 


Pi —0.00 .. . Ody pdng2 -- 0 


(a) "Suppose*tltaiey (aja we (a)"— h(a), that) f(x) = g(x) = he) siorall, 
and that f’(a) = h’(a). Prove that’g is differentiable at a, and that 
f(a) = g(a) = h(a). (Begm with the definition of g’(a).) 

(b) Show that the conclusion does not follow if we omit the hypothesis 


F (ah= wile) — AG): 


Let f be any polynomial function; we will see in the next chapter that f 
is diflerentiable. The tangent hne to f at (a, f(a)) is the graph of g(x) = 
f'(a)(x — a) + f(a). Thus f(x) — g(x) is the polynomial function d(x) = 
f(x) — f(a — a) — f(a). We have already seen that if f(x) = x?, then 
d(x) =(x - a). angett 7 (¢).= x?, then d(x) = (x — Deeeas 2a). 


(a) Find. d@jwhen f(x) = xt, and show that it is divisible by (x — De. 

(b) ‘There certainly seems to be some evidence that d(x) is always divisible by 
(x —a)*. Figure 22 provides an ituitive argument: usually, lines parallel 
to the tangent line will intersect the graph at two points; the tangent Ime 
intersects the graph only once near the point, so the intersection should 
be a “double intersection.” ‘To give a rigorous proof, first note that 


d(x) f(x) = fla) 


xX —& Bay C 


i (a 


Now answer the followmg questions. Why is f(x) — f(a) divisible 
by (x — a)? Why is there a polynomial function fr such that A(x) = 
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pages 


Zoe 


24. 
2D. 


26. 


rails 


28. 


d(xa)/(x — a): foros Ala? Why iselimdi(4) = 0? Why is ita) = 0? Why 


does this solve the problem? 


(a) Show that f’(a) = lim[f(@) — f(a)]/(« — a). (Nothing deep here.) 
Xa ; 

(b) Show that derivatives are a “local property”: if f(x) = g(x) for all x in 
some open interval containing a, then f’(a) = g’(a). (This means that 
in computing f’(a), you can ignore f(x) for any particular x 4 a. Of 
course you can't ignore f(x) for all such x at once!) 

(a) Suppose that f is differentiable at x. Prove that 

. fix th)- fx-h) 
/ 
(x) = hm ———————__.. 
f h->0 2h 
Hint: Remember an old algebraic trick —a number is not changed if the 
same quantity is added to and then subtracted from it. 
**(b) Prove, more generally, that 
, Or sel) SC 15) 
¢ Corbin eA aot oer GS 
h,k>0* h ar k 
Although we haven't encountered something like lm_ before, its mean- 
; - h.ka> 
ing should be clear, and you should be able to make an appropriate e-6 
definition. ‘The important thing here is that we actually have lim, , SO 
: h,k> OF 
that we are only considering positive ft and k. 

Prove that if f is even, then f’(x) = —f’(—x). (In order to minimize con- 

fusion, let g(x) = f(—x); find g’(x) and then remember what other thing g 

is.) Draw a picture! 

Prove that if f is odd, then f’(x) = f’(—x). Once again, draw a picture. 

Problems 23 and 24 say that f’ is even if f is odd, and odd if f is even. 

What can therefore be said about f? 

Kinde, “(s) at 

OM ees 

ao) = ee 

(iit) f(x) = x". 

(iv) f@+3) =x. 

If S,@c) = x", and 0< k <7, prove that 

n! 
iC (x) =e tk 
‘ (n —k)! 
= n xk 
Kye 
(a) “hind ™ Qe) Lei = iv(?. Find f’(x). Does f’’(x) exist for all x? 
(b) Analyze f sumilarly if f(x) = x" fora = 0 aide &) = =x 8Mor x = 0! 
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29. Let f(x) =x" for x => 0 and let f(x) = 0 for x < 0. Prove that f° exists 
(and find a formula for it), but that f'”’(O) does not exist. 


30. Interpret the followmg specimens of Leibnizian notation; each is a restate- 
ment of some fact occurring in a previous problem. 


ifs n 
(i) — =nx"! 
dz ] 1 
is au Pe eee 
tu) dy y? y 
wa Ol f)-Fel dfeo 
(111) = 
dx dx 
nu, Alef e)] df (x) 
(iv) ———— = c— 
dx dx 
dz dy 
(v) ein AS if Z=yt+e. 
dx 5 
1) — = 3at, 
(vi) de a 
., df(x+a) df (x) 
(vu) ———— = —— , 
dx x=b dx x=b+a 
ey Sr See 
dx r=b dx x=ch 
ayy GES) df(y) 
x =c+ — 
. dx dy |yae 


d* x" n 
x =. ar 
) dxt (;) : 


CHAPTER 


THEOREM 1 


PROOF 


THEOREM 2 


PROOF 


DIFFERENTIATION 


The process of finding the derivative of a function 1s called differentiation. From the 
previous chapter you may have the impression that this process 1s usually laborious, 
requires recourse to the definition of the derivative, and depends upon successfully 
recognizing some limit. It is true that such a procedure is often the only possible 
approach—if you forget the definition of the derivative you are likely to be lost. 
Nevertheless, in this chapter we will learn to differentiate a large number of func- 
tions, without the necessity of even recalling the definition. A few theorems will 
provide a mechanical process for differentiating a large class of functions, which 
are formed from a few simple functions by the process of addition, multiplication, 
division, and composition. ‘This description should suggest what theorems will be 
proved. We will first find the derivative of a few simple functions, and then prove 
theorems about the sum, products, quotients, and compositions of differentiable 
functions. The first theorem is merely a formal recognition of a computation 
carried out in the previous chapter. 


If f is a constant function, f(x) = c, then 


f(a) =0 for all numbers a. 


ye a 
1 at) BASS 01/9) St) ay eee 
h—0 h ho>0 hh 


=0., 


The second theorem is also a special case of a computation in the last chapter. 


If f is the identity function, f(«) = x, then 


f(a) =1- for all numbers a. 


, @) = imi Jatrw = 1@ T= Sw 
: h—0 


h 
| ath—a 
= jm —————_ 
h=0 h 
oh 
=lm—= 1. 
noo h 


The derivative of the sum of two functions is just what one would hope— the 
sum of the derivatives. 
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THEOREM 3 If f and g are differentiable at a, then f + g is also differentiable at a, and 
(f +8)'(a) = f'(a) + 9'(a). 


(f+age)(a+h)—-(f+g)(a) 


PROOF (f +8)(a) = lim l 
— 1 
— jim LEY tae +) — LO + 8@)] 
_ h—0 h 
; [a +h)—f(@)  glatkh)— ad 
= i eS a Se 
h—O h h 
. flat+h)— fla) . gla+h) — g(a) 
= hm A. +-_- Sr ms A 
h>0 h h—>0 h 


= f'(a)+e'(a).J 


The formula for the derivative of a product is not as simple as one might wish, 
but it is nevertheless pleasantly symmetric, and the proof requires only a simple 
algebraic trick, which we have found useful before—a number is not changed if 
the same quantity is added to and subtracted from it. 


THEOREM 4 If f and g are differentiable at a, then f - g is also differentiable at a, and 
(f -g) (a) = f'(a)- g(a) + fla) - g(a). 


PROOF (f - g)'(a) = lim eee “df 810) 


h 
._ flath)glat+h)— fla)g(a) 
St ee et 
a0 h 
- fy PR ie 
h=> ] F 


x hy — o A) — 
=n eer lim 2 a8) |, LO 
h-0 ha) h ana h 


= f(a)-g'(a)t+ f(a): g(a). 


(Notice that we have used Theorem 9-1! to conclude that him flath) = f(a).) 
ho 


- lim g(a) 
ho>0- 


In one special case Theorem 4 simplifies considerably: 


THEOREM 5 If g(x) =cf(x) and f is differentiable at a, then g is differentiable at a, and 


9 (a)=a- f @. 


PROOF If A(x) =c, so that g =h- f, then by Theorem 4, 


g(a) = th: 7) (a) 
=h(a)- f(a) +h'(a)- fla) 
=c-+ f'(a)+0- fla) 
= ¢- jaa). 
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THEOREM 6 


PROOF 


Notice, mn particular, that (—f)/(a) = — f’(a), and consequently (f — g)/(a) = 
(f tale) @) — f(a) — 2 (a). 

To demonstrate what we have already achieved, we will compute the derivative 
of some more special functions. 


If f(x) =x" for some natural number 7, then 


f(a) =na""! for all a. 


The proof will be by induction on n. For n = I this is simply Theorem 2. Now 
assuine that the theorem is true for 1, so that if f(x) = x”, then 


| 


f@ =n for all a. 


n+] Hts 


ete) = x'tl If I(x) =x, the equation x x" -x can be written 


g(x) = f OD ahe) for all’x: 
thus g = f- TJ. It follows from Theorem 4 that 
fila) Ta) + fla): I'(a) 
=na"!.ata"-1 


= na" +a" 
=(n+ 1a", for all a. 


pe (a) =(f- 1) @) 


This is precisely the case n + 1 which we wished to prove. J 


Putting together the theorems proved so far we can now find f’ for f of the 
form 


aoe 2, 
ROY GX + Gn 1X ep =: Ge Gane eo. 
We obtain 
i 5 5) 
fy (oa, x Lntp— la, xs era ae 


We can also find f”: 


3 


fo =n Wak Ok ee ie eee 


This process can be continued easily. Each differentiation reduces the highest 
power of x by 1, and eliminates one more a;. It is a good idea to work out the 
derivatives f’”", f, and perhaps f©, until the pattern becomes quite clear. The 
last interesting derivative is 
f'"@) = nian: 
for k > n we have 
ec) =) 

Clearly, the next step in our program is to find the derivative ofa quotient f/g. 
It is quite a bit simpler, and, because of ‘Theorem 4, obviously sufficient to find 
the derivative of I/g. 
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THEOREM 7 If g is differentiable at a, and g(a) 4 0, then 1/g is differentiable at a, and 
1 \e —v'(a) 
—]| (a) = ——.. 
(;) [s@P 


PROOF Before we even write 


(~) (ath) — @ (a) 
8g is 
h 


we must be sure that this expression makes sense—it 1s necessary to check that 
(1/g)(a+h) is defined for sufficiently small 4. This requires only two observations. 
Since g 1s, by hypothesis, differentiable at a, it follows from Theorem 9-1 that g is 
continuous at ad. Since g(a) 4 O, it follows from ‘Theorem 6-3 that there 1s some 
6 > Osuch that g(a +h) 4 0 for |h| < 6. Therefore (1/g)(a +h) does make sense 
for small enough /, and we can write 


1 | | \ 
—}](a+h)—{-—] (a) ae 
lim -22-— V8 7 slath) gta) 
= h ew j 


g(a) — g(a +h) 


= hm Sa —_ 
h-0 h[g(a)- g(a +h)] 
_ —-[g(at+h)—- g(a)] | 
= —  —————— ee. 2s 
h>0 h g(a)gla +h) 
ee CS Se I 
= lm SH .. lm, MH 
h—0 h h>0 g(a): g(a th) 
= —g'(a)- —— >. 
- car 


(Notice that we have used continuity of g at a once again.) § 


The general formula for the derivative of a quotient is now easy to derive. 
Though not particularly appealing, it is important, and must simply be memo- 
rized (I always use the incantation: “bottom times derivative of top, minus top 
times derivative of bottom, over bottom squared.” ) 


THEOREM 8 If f and g are differentiable at a and g(a) 0, then f/g is differentiable at a, 
and 
(4) g(a): f(a) — fla)- g'(a) 
(a) = ——__—_______ >. 


2 [g(a)|? 
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Since f/g = f - (1/g) we have 


_ Ul | / 
(“) (a) (s-=) (a) 
g g& 
n | iP te 
H | @-(~)@+ sa, (-) (a) 
g g& 


fila) fla\(—g'(a)) 


g(a) [g(a)]* 
_ f'(a) g(a) — f(a)- g'(a) i 
[s(a)]? 


We can now differentiate a few more functions. For example, 


x7] (x* + 1)(2x) — (x? = 1)Qx) 4y 
if f(x) = ——., the Ag Ey a ea a 
Aap Oe? Pel erp (x? +1)? 
it x 4 (x- + 1) — x(2x) 1 —x? 
f f(x) =——, the x) = ————__—__ = —— : 
uF) 12S Fe) (x2 + 1)? (x2 + 1)? 

l l 
if f(x)=-, then f’(x) = —— = (—I)x~?. 

x x* 


Nouce that the last example can be generalized: if 


f(x) =x "= —, — for some natural number n, 
x 
then 
n—1 
—nx oe 
f(x) = —— = (-n)x" I. 
xen 


thus Theorem 6 actually holds both for positive and negative integers. If we inter- 
pret f(x) =x" to mean f(x) = 1, and f(x) =0- x! to mean f’(x) = 0, then 
Theorem 6 is true for n = 0 also. (The word “interpret” is necessary because it is 
not clear how 0° should be defined and, in any case, 0 - 0~! is meaningless.) 
Further progress in diflerentiauion requires the knowledge of the derivatives of 
certain special functions to be studied later. One of these is the sine function. For 
the moment we shall divulge, and use, the followmg information, without proof: 


sin (a) = cosa for all a, 
cos (a) = —sina for all a, 
This information allows us to differentiate many other functions. For example, if 
f(x) =xsm¥, 
then 
f'(x) =x cosx +sin x, 


f(x) = —x sin x + cosx + cosx 
= —x sin x +2 cos x; 


10. Differentiation 173 


if 
; -% ; : 
oe) SsSlpeee— SI eo 1, 
then 
g(x) = sin.x cosx + cos x sin x 
ees J, 
g(x) = 2[(sin.x)(— sin x) + cos x cos x] 
2 io) 
=, 2\\cassae sift x |- 
if 
2 
BCS) = Gos te—.COs XV COSL, 
then 


h'(x) = (cos x)(— sin.x) + (— sin x) cos x 
= —2sin x cos Xx, 
h" (x) = —2[cos” x — sin? x]. 
Notice that 
g(x) +h’) =9, 


hardly surprising, since (g + h)(x) = sin? x + cos? x = 1. As we would expect, we 
also have g”(x) +h”"(x) = 0. 

‘The examples above involved only products of two functions. A function involy- 
ing triple products can be handled by ‘Theorem 4 also; in fact it can be handled 


in two ways. Remember that f - g-/ is an abbreviation for 


Ce Be RE 
Choosing the first of these, for example, we have 


(f-g-hyY@)=(f- 8) @)- h@) t+ Ff - g)x) h(x) 
=[f' (vg) + fix)g(WJh(x) + fxg Q@h'(x) 
= f'(x)g(x)h(x) + f(x)g’(x)h(x) + fx)g(x)h'(). 


The choice of f - (g +) would, of course, have given the same result, with a 
different termediate step. ‘The final answer is completely symmetric and easily 
remembered: 


(f -g-h)’ is the sum of the three terms obtained by diflerentiating each of f, 
g, and ft and multiplying by the other two. 


For example, if 
= x? sim x cos x, 
then 
a 3x7 sinx cosx +x? cos x cosx + x3(sin x)(—sinx). 
Products of more than 3 functions can be handled similarly. For example, you 
should have little difheulty deriving the formula 
(f-g-h- ky (x)= f')g@)hyk(x) + f@)g' WAG )K(X) 
+ fix)g(x)h'(x)k(x) + fix)g(x)h(xyk (x). 
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You might even try to prove (by induction) the general formula: 


n 
Hien fa) C—O gee cw Ce) fits) 0. = 
= 
Differentiating the most interesting functions obviously requires a formula for 
(f og)'(x) in terms of f’ and g’. To ensure that f og be differentiable at a, one 
reasonable hypothesis would seem to be that g be differentiable at a. Since the 
behavior of fo g near a depends on the behavior of f near g(a) (not near a), it 
also seems reasonable to assume that f 1s diflerentiable at g(a). Indeed we shall 
prove that if g is diflerentiable at a and f is differentiable at g(a), then f o g is 
differentiable at a, and 


(fog) (a) = f'(gla))- g(a). 


This extremely important formula is called the Cham Rule, presumable because 
a composition of functions might be called a “chain” of functions. Notice that 
(f og)’ is practically the product of f’ and g’, but not quite: f’ must be evaluated 
at g(a) and g’ at a. Before attempting to prove this theorem we will try a few 
applications. Suppose 


Loon sin x. 
Let us, temporarily, use 5 to denote the (“squaring”) function S(x) = x7. ‘Then 
7 = sien, 
‘Therefore we have 


f'(x) = sin’(S(x)) - S'(x) 


= cogs - 2. 
Quite a diflerent result is obtained if 
Bt tS a 
Siy=sm «= 


In this case 


ye Sho Shy 


so 


/ 


f'(x) = S(sinx)- sin’) 


= 2sinx-cosXx. 


Notice that this agrees (as it should) with the result obtamed by writmg f = sin- sin 
and using the product formula. 


6 


Although we have invented a special symbol, S$, to name the “squaring” function, 
it does not take much practice to do problems like this without bothering to write 
down special symbols for functions, and without even bothering to write down the 
particular composition which f is--one soon becomes accustomed to taking f 
apart in one’s head. ‘Vhe following differentiations may be used as practice for 
such mental gymnastics if you find it necessary to work a few out on paper, by 
all means do so, but try to develop the knack of writing f’ immediately afier seemeg 
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the definition of f; problems of this sort are so simple that, if you just remember 
the Chain Rule, there is no thought necessary. 


i f (cv) = sim x? then f’(x) = cosx? + 3x? 

ro = sin? x 7; Gor sin? x - cos x 

ay P. l —] 
eC = sii — 7 10) es a 

x ov wwe 

7) = sins w) f(@)e= eosin) - GOs x 
1) = sin(x? + 3x) 1 cos(x> + 3x7) - (3x7 + 6x) 
(OHO 231g A@.253 VC -o2): 


A function hke 


ae: w 5 my) 
ic —sinigee ale nae 
which is the composition of three functions, 
if aio sino Si. 


can also be differentiated by the Chain Rule. It is only necessary to remember 
that a triple composition f o goh means (f og)oh or f o(g oh). Thus if 


5 i) 
LQ) ]siie x 
we can write 


f =(G6 osm) cy 
7? = S22. Gino S) 


The derivative of either expression can be found by applying the Chain Rule 
twice; the only doubtful point is whether the two expressions lead to equally simple 
calculations. As a matter of fact, as any experienced differentiator knows, it is much 
better to use the second: 


j= on oS). 


We can now write down f’(x) in one fell swoop. ‘To begin with, note that the first 
function to be differentiated is S, so the formula for f’(+) begins 


ey — 2 ) 


. . 9 x . . 
Inside the parentheses we must put sin.xv*, the value at x of the second function, 
smo S. Thus we begin by writing 


a . 33 
Jie lem Sie 


(the parentheses weren't really necessary, after all). We must now multiply this 
much of the answer by the derivative of sino S at x; this part is casy-——it involves a 
composition of two functions, which we already know how to handle. We obtain, 
for the final answer, 


= . 2 ay 
i @) = 2st 2 © Ges sea 
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The following example is handled similarly. Suppose 


f(x) =sm(sin x2). 


Without even bothering to write down f as a composition gohok of three functions, 
we can see that the left-most one will be sm, so our expression for f’(x) begins 


fF Co 0s ) 
Inside the parentheses we must put the value of ho k(x); this is simply sin x? (what 
~ . . @] . ~ . . ~ . 
you get from sin(sin.x~) by deleting the first sin). So our expression for f’(*) begins 
4 : 5) 
J @® ScosGinx) 


m s = oe P cS a) 5 

We can now forget about the first sin in sm(sin.x7); we have to multiply what we 
. . . . . 2) . . 

have so far by the derivative of the function whose value at x is sin.x-— which is 

again a problem we already know how to solve: 


. ): 5] 
fF (&) = consin x 9" Cie 2., 


Finally, here are the derivatives of some other functions which are the composition 
of sm and S$, as well as some other triple compositions. You can probably just 
“see” that the answers are correct— if not, try writing out f as a composition: 


ft £G) Sin (sin x)*) then f’(x) = cos((sin x)*) -2 sin x -cosx 
f(x) = [sin(sin Ryle f &) = Z2sm(Girer)” cos(Gim en ose 
f(x) Ssim(sin(sm x )) f' (x) = cos(sin(sin.x)) - cos(sin x) - cos x 
hr) = sin? (x sin x) f' (x) = 2sin(x sim x) - cos(x sin x) 


- [sin x +.x cos x | 
. . nee a . pi Doe 
i 6 Sse sin 2)) i Go) =los(sin(e sit) costr” sume) 
. 9) 
- (2% sin x = a cos x]. 
The rule for treating compositions of four (or even more) functions 1s easy — 


always (mentally) put in parentheses starting from the right, 


f o(go(hok)), 


and start reducing the calculation to the derivative of a composition of a smaller 
number of functions: 


f(g (h(k(x)))) 
For example, if 


io sin? (sin? («)) Lf = Sos =:S © sin 
= S$ (sin > ($ » sin))] 


then 


. . ee), ae): . 
f («) = 25fGin* x) » Gos(sin- wx) - 2 sinw - GOsw: 
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if 
J) =smism x)?) [f=sne Sosmes 
= sin (So (imo 'S))| 
then 
f'(x) = cos((sin x7)*) - 2 sin x7 - cos.x? - 2x: 
ih, 
f (x) = sin? (sin(sin.x)) [fill in yourself, if necessary | 
then 


f(x) = 2sin(sin(sin x)) + cos(sin(sin x)) - cos(sin x) + cos x. 


With these examples as reference, you require only one thing to become a master 
diflerentiator—practice. You can be safely turned loose on the exercises at the end 
of the chapter, and it is now high time that we proved the Chain Rule. 

The following argument, while not a proof, indicates some of the tricks one 
might try, as well as some of the difficulties encountered. We begin, of course, 
with the definition— 


ON {es h)— fe) 
ha @ Ga) = him Cee tanga) —f oenta) 


10 h 
. f(gla+h)) — f(g(a)) 
= hn 
h>0 h 


Somewhere in here we would like the expression for g’(a@). One approach is to 
put it in by fiat: 

as F(glarbh)) — F(e@) _ an JG a + Tas Sa) ola tit) = g(a) 
h>0 h ~ oO g(a +h) — gla) h , 
This does not look bad, and it looks even better if we write 

. if C8) By Ome) 

ki. SS 

h—0 h 


— fm LEO 4 [8 +4) = g@)—-fe@) | sa +h) — sla) 
7-20 g(a +h) — g(a) h—0 h 


The second limit is the factor g’(a) which we want. If we let g(a +h) — g(a) =k 
(to be precise we should write k(/7)), then the first limit is 


ee ear £) — f(a(@)) 
hn, 
h—O k 


It looks as if this mit should be f’(g(a)), since continuity of g at a implies that k 
goes to 0 as ht does. In fact, one can, and we soon will, make this sort of reasoning 
precise. ‘There is already a problem, however, which you will have noticed if you 
are the kind of person who does not divide blindly. Even for h 4 0 we might have 
g(a +h) — g(a) = 0, making the division and multiplication by g(a +h) — g(a) 
meaningless. ‘True, we only care about small #, but g(a +) — g(a) could be 0 
for arbitrarily small h. The easiest way this can happen is for g to be a constant 
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THEOREM 9 (THE CHAIN RULE) 


PROOF 


fulichony g(x) = cM hen g(a -- &) —*e(a)t0 ior all &. In this case, f & ¢ 1s also 
a constant function, (f o g)(x) = f(c), so the Chain Rule does indeed hold: 


(fog) (a) =0= f'(g(a))- 8'(a). 
However, there are also nonconstant functions g for which g(a +/1) — g(a) = O 
for arbitrarily small i. For example, if a = 0, the function g might be 
Dee | 
xsi =, v0 
x 
0, noe 


g(x) = 


In this case, g’(O0) = 0, as we showed in Chapter 9. Ifthe Chain Rule is correct, we 
must have (f° g)’(O) = 0 for any diflerentiable f, and this is not exactly obvious. 
A proof of the Chain Rule can be found by considering such recalcitrant functions 
separately, but it is easier simply to abandon this approach, and use a trick. 


If g is differentiable at a, and f is differentiable at g(a), then f og is differentiable 
ated, alc 


(fee) Gut (@))- 2 @. 


Define a function ¢ as follows: 
J (g(a tui) — 7 (etay) 
o(h) = g(a +h) — g(a) 
f'(gla)), if g(ath)— g(a) =0. 


It should be intuitively clear that @ is continuous at 0: When /: is small, 


if g(a +h) — g(a) 40 


g(a +h) — g(a) is also small, so if g(a +h) — g(a) is not zero, then @(/) will 
be close to f’(g(a)); and if it is zero, then @(/) actually equals f’(g(a)), which 
is even better. Since the continuity of @ is the crux of the whole proof we will 
provide a careful translation of this intuitive argument. 

We know that f is differentiable at g(a). This means that 


y= 
lim y (tas ; f(g a)) Say 


Thus, if ¢ > 0 there is some number 6’ > 0 such that, for all k, 


(@) iO = \k| <8, then 7 


— f'(g(a))| <. 
Now g is differentiable at a, hence continuous at a, so there is a 6 > O such that, 
for all'?,, 

(2) if |h| <6, then 


g(at+h) — gla) < 8. 
Consider now ay h with \/r| <6.Ifk=—= g(a t+ kh) — g(a) = 0. then 


ate UG ie) — PEG)) _ fea) ake) 
g(a +h) — g(a) - k 


it follows from (2) that [A] < 6’, and hence from (1) that 


Ip(h) — f'(g(a))| <e. 


ph) = 
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On the other hand, if g(a + h) — g(a) = 0, then @(h) = f'(g(a)), so it is surely 
true that 


Ip(h) — f'(g(a))| < €. 
We have therefore proved that 
lim $(h) = f'(g(a)). 


so @ is continuous at 0. The rest of the proof is easy. If h 4 0, then we have 


Fig(a +h)) — f(g(a)) g(a+h) — gla) 
I pt iy © 
h h 
even if g(a +h) — g(a) = 0 (because in that case both sides are 0). Therefore 
5 mote h))—f : ( h)— gla 
(f > 2) (a) = lim I (gla + h)) — f(gla)) os lim p(h) . lim gla +h) — 8(a) 
h—-0 h h—-0 h—-0 h 


= f'(g(a))- g'(a). J 


Now that we can differentiate so many functions so easily we can take another 
look at the function 


2 4 - 
oe x*sin=, 3 #0 


QO, x:='0: 


In Chapter 9 we showed that f’(0) = 0, working straight from the definition (the 
only possible way). For x 4 0 we can use the methods of this chapter. We have 


aad . | , ] ( ] ) 
f(x) = 2x sin — +x*°cos—-|-—— }; 
x : 


Xx x 
Thus 
l 
7 era aS oe : 
Fie 2x sin 3 COs = x £0 
0, r= 0; 


As this formula reveals, the first derivative f’ is indeed badly behaved at 0—it is 
not even continuous there. If we consider instead 


f(x) = xesin-, x40 
fxy)= = 
then 
oe i 
f(x) = 3x°sin ROOST, x #0 
0, x =0. 


In this case f’ is continuous at 0, but f”’(0) does not exist (because the expres- 
. 2 > - . . . “11 . . 
sion 3x*sin 1/x defines a function which is differentiable at 0 but the expression 


—x cos 1/x does not). 
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As you may suspect, increasing the power of x yet again produces another 
improvement. If 
Ae oe 
fn x sin a x10 
iW). A: 


then 


I bg hel 
go= 4x° sin ~ F cosT. x #0 
. ew) 


It is easy to compute, right from the definition, that (f’)'(0) = 0, and f”(x) is 
easy to find for x 4 0: 
, | I ay 
ie }2x- sin — — 4x cos— —2xcos——sn—-, x #0 
f(x) = a x i x 
OD) Xa 


In this case, the second derivative f” 1s not continuous at 0. By now you may have 
guessed the pattern, which two of the problems ask you to establish: if 


l 
For x2" sin A eA 


O) a= 0, 
then f’(0),... , f(O) exist, but f is not continuous at 0; if 


| 
9 . 
Co sil ee 0 


fMm= 4° x 
0, ve) 


then f’(0),..., f° (O) exist, and f? is continuous at 0, but f is not differ- 
entiable at O. ‘hese examples may suggest that “reasonable” functions can be 
characterized by the possession of higher-order derivatives—no matter how hard 
we try to mask the mfinite oscillation of f(x) = sin 1/x, a derivative of sufficiently 
high order seems able to reveal the underlying irregularity. Unfortunately, we will 
see later that much worse things can happen. 

After all these mvolved calculations, we will brmg this chapter to a close with 
a mimor remark. It is often tempting, and seems more elegant, to write some of 
the theorems in this chapter as equations about functions, rather than about their 
values. Thus Theorem 3 might be written 


Cf 8) of eee 
Theorem 4 might be written as 
C7 nae a ne 
and ‘Theorem 9 often appears m the form 


Gf oe) — (fee) a. 
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Strictly speaking, these equations may be false, because the functions on the left- 


hand side might have a larger domain than those on the right. Nevertheless, this 


is hardly worth worrying about. If f and g are differentiable everywhere in their 


domains, then these equations, and others like them, ave true, and this 1s the only 


case any one Cares about. 


PROBLEMS 


}. 


As a warm up exercise, find f(x) for each of the following f. (Don’t worry 
about the domain of f or f”; just get a formula for f’(x) that gives the right 
answer when it makes sense.) 


(i) f(x) =sin(x +x”). 
Gi), f@) =sina sin x. 
bil) falr;) ==nSIM GOS st): 
(iv) f(x) = sin(sin 2 

B / GOS%, 
(v) f(x) = sin ( : iy 


sin(cos x) 


wi) foy= 
(vn) f(x) =sm(x +sinx). 
(vin) f(x) = sin(cos(sin.x)). 


Find f’(x) for each of the followmg functions f. (It took the author 20 min- 
utes to compute the derivatives for the answer section, and it should not take 
you much longer. Although rapid calculation is not the goal of mathematics, 
if you hope to treat theoretical apphcations of the Chain Rule with aplomb, 
these concrete applications should be child’s play mathematicians like to 


pretend that they can’t even add, but most of them can when they have to.) 


sin((x 26.1 )*(& + 2)). 
sin? (x2 + sin x). 


Gil) f(x) = sin?((x + sinx)?). 
Py 

(iv) f@)=sin 5 

cos .x- 
(vy) f(x) = sin(x sin.x) + sin(sin x2). 
(ee) (cosx 3, 
(Ul Oy sin? x sin x2 sin? x2. 
(a) May — sin? (sin7(sin.x)). 
pee Ci sin? x)°. 
(x) f(x) = sin(sin(sin(sin(sin .v)))). 
(xi) f(x) = sin((sin’ x? + 1)”). 
(xt hs) SM @e er) 0)? tow 
(xiii) f(x) = sin(x? + sin(x2 + sin x2)). 
(xiv) f(x) = sin(6 cos(6sin(6 cos 6x ))). 
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. 9 Wee 
sun x” sin Xx 


(xv) f(x) = l a sin x 
(xvi) f(x) = 
xvi) f(x) D 
Ni an 
x +sinx 
<0 
(xvii) f(x) =sin] . ( x3 
sin | — 
sin x 
x 
(xvi) f(x) = sin ini (—*] 
x —sinx 


Find the derivatives of the functions tan, cotan, sec, cosec. (You don’t have 
to memorize these formulas, although they will be needed once in a while; if 
you express your answers in the right way, they will be simple and somewhat 
symmetrical.) 


For each of the following functions f, find f’(f(x)) (not (f © f)’Cx)). 


- l 
(1) f(x) = ae 


(a) =f) =sin:x. 
(iii) f(x) = x?. 
(iv) fe) SALT: 


For each of the following functions f, find f(f’(x)). 


e ] 
a) s@)=— 


x 
(ii) f(x) =x?. 
an) fSe)y= 17, 
(iv) Ge) —alex: 


Find f’ in terms of g’ if 


(i) fie y=2@ g(a)). 
(ii) f(x) = g(x: g(a)). 
(ii) f(x) = g(x + g(x)). 
(iv) f(x) = e(x)(* — a). 
(v) f(x) =2(a)(%~—a). 
(vi) f(x +3) = g(x’). 


(a) A circular object is increasing in size in some unspecified manner, but it 
is known that when the radius is 6, the rate of change of the radius is 4. 
Find the rate of change of the area when the radius is 6. (If r(t) and A(t) 
represent the radius and the area at time ¢, then the functions r and A 
satisfy A = ar: a straightforward use of the Chain Rule is called for.) 
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11. 
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(b) Suppose that we are now informed that the circular object we have been 
watching is really the cross section of a spherical object. Find the rate 
of change of the volwne when the radius is 6. (You will clearly need to 
know a formula for the volume of a sphere; mm case you have forgotten, 
the volume is a0 times the cube of the radius.) 

(c) Now suppose that the rate of change of the area of the circular cross 
section is 5 when the radius is 3. Find the rate of change of the volume 
when the radius is 3. You should be able to do this problem in two 
ways: first, by using the formulas for the area and volume in terms of 
the radius: and then by expressing the volume in terms of the area (to 
use this method you will need Problem 9-3). 


mn) . . . . . Ch) ed) rn 
The area between two varying concentric circles is at all times 97 in-. The 
. . oe) . 
rate of change of the area of the larger circle is 10 in-/sec. How fast is the 
° ~ . . . eg) 
circumference of the smaller circle changing when it has area 167 m-? 
Particle A moves along the positive horizontal axis, and particle B along the 


graph of f(x) = ~J/3x,x <0. Ata certain time, A is at the point (5,0) 
and moving with speed 3 units/sec; and B is at a distance of 3 units from 
the origin and moving with speed 4 units/sec. At what rate is the distance 
between A and B changing? 


Let f(x) =x? sin 1/x for x 4 0, and let f(0) = 0. Suppose also that # and k 


are two functions such that 


h'(x) = sin7(sin(x + 1)) Mea: =wh ete) 
in( Opes KO}; =O. 


Find 
(1) (f of) (0): 
(iu) (ko f)’(0). 


(iii) @’(x*), where a(x) = h(x). Exercise great care. 
Find f’(Q) if 

g(x) sin 7 x £0 

0, | a0). 


{[@= 


and 


0) = ei) eG 


Using the derivative of f(x) = 1/x, as found in Problem 9-1, find (1/g)’(x) 
by the Chain Rule. 


(a) Using Problem 9-3, find f’(x) for —] <x < 1, if f(x) = V1 — x2. 

(b) Prove that the tangent line to the graph of f at (a. v1 — a2 ) intersects 
the graph only at that pomt (and thus show that the elementary geometry 
definition of the tangent Ime coincides with ours). 
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Prove similarly that the tangent lines to an ellipse or hyperbola intersect these 
sets only once. 


If f + g¢ 1s differentiable at a, are f and g necessarily differentiable at a? 
If f-g and f are diflerentiable at a, what conditions on f imply that g is 
differentiable at a? 


(a) Prove that if f is differentiable at a, then |f| is also differentiable at a, 
provided that f(a) € 0. 

(b) Give a counterexample if f(a) = 0. 

(c) Prove that if f and g are differentiable at a, then the functions 
max(f,g) and min(f, g) are differentiable at a, provided that f(a) 4 
g(a). 

(d) Give a counterexample if f(a) = g(a). 


Give an example of functions f and g such that g takes on all values, and fog 
and g are differentiable, but f isn’t differentiable. (The problem becomes 
trivial if we don't require that g takes on all values; g could just be a constant 
function, or a function that only takes on values in some interval (a,b), in 
which case the behavior of f outside of (a,b) would be irrelevant.) 


(a) If g = f? find a formula for g’ (involving f’). 

(b) If g = (f°, find a formula for g’ (involving f”). 

(c) Suppose that the function f > 0 has the property that 
l 

pe: 


Find a formula for f” m terms of f. (In addition to simple calculations, 


eas 


a bit of care is needed at one pomt.) 


If f is three times differentiable and f’(x) 4 0, the Schwarzean denwative of f 
at x is defined to be 


i) 3 ff @&) 2 
a = mee 
De a 5 (Fe) | 


(a) Show that 
(fog) = (Df og]-g*+Ag. 


x+l 
(b) Show that if f(r) = oe with ad — be £ 0, then Df = 0. Conse- 
Cx + ¢ 


quently, SC f og) = Dg. 


Suppose that f(a) and g(a) exist. Prove Letbniz’s formula: 


HN 


(fg) (a) =D ({) ray gi (a). 


k=0 


a 


225 


aay 
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Prove that if f(g(a)) and g(a) both exist, then (f o g) (a) exists. A 
little experimentation should convince you that it is unwise to seek a formula 
for (f og) (a). In order to prove that (f og)" (a) exists you will therefore 
have to devise a reasonable assertion about (f 0g) (a) which can be proved 
by induction. ‘Try something like: “(f 0 g))(a) exists and is a sum of terms 
each of which is a product of terms of the form... .” 


(a) Blt Cale ax” + a,x"! +4---+a9, find a function e sichithat.2 = f. 
Find another. 


(b) If 


find a function g with g’ = f. 
(c) Is there a function 


b 
A) 0 Ee ee a 
such that 7 (ce) = ee 
Show that there 1s a polynomial function f of degree n such that 


(a) f'(x) =0 for precisely 1 — 1 numbers x. 

(b) f’(x) = 0 for no x, if n is odd. 

(c) f’(x) = 0 for exactly one x, if m is even. 

(d) f’(x) =0 for exactly k numbers x, if n — k is odd. 
( 


a) ‘The number a is called a double root of the polynomial function f if 
Po) — er a)" g(x) for some polynomial function g. Prove that a is a 
double root of f if and only if a is a root of both f and f’. 

(b) When does f(x) = ax? + bx +c (a ¥ 0) have a double root? What does 
the condition say geometrically? 


If f is differentiable at a, let d(x) = f(x) — f’(a)(x —a) — f(a). Find d’(a). 


In connection with Problem 24, this gives another solution for Problem 9-20. 


This problem is a companion to Problem 3-6. Let aj,...,a, and bj,..., Dn 
be given numbers. 


(a) If xj,...,%» are distinct numbers, prove that there is a polynomial func- 
Uo f ofdesree 2n — I, such that jie — 7 O;) = 0 for 7 ~ 7% and 
4 (x;) =a; and, f'(x;) =b;. Hint: Remember Problem 24. 

(b) Prove that there is a polynomial function f of degree 2n —1 with f(x) = 
a; and f'(x;) = b; forall} 


Suppose that a and 6 are two consecutive roots of a polynomial function f, 
but that a and b are not double roots, so that we can write f(x) = 


(x — a)(x — b)g(x) where g(a) 4 0 and g(b) £ 0. 


(a) Prove that g(a) and g(b) have the same sign. (Remember that a and b 
are consecutive roots.) 
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(b) Prove that there is some number x with a < x < band f’(x) =0. (Also 
draw a picture to illustrate this fact.) Hint: Compare the sign of f’(a) 
and f’(b). 

(c) Now prove the same fact, even if @ and b are multiple roots. Hint: If 
f@) = =a)" (x — b)" g(x) where g(a) 4 0 and’ g(b) S$ 0, consider the 
polynomial funcuon h(x) = f’(x)/(x — ay ee) a 


‘This theorem was proved by the French mathematician Rolle, in connection 
with the problem of approximating roots of polynomials, but the result was 
not originally stated in terms of derivatives. In fact, Rolle was one of the 
mathematicians who never accepted the new notions of calculus. ‘This was 
not such a pigheaded attitude, in view of the fact that for one hundred years 
no one could define limits in terms that did not verge on the mystic, but on 
the whole history has been particularly kind to Rolle; his name has become 
attached to a much more general result, to appear in the next chapter, which 
forms the basis for the most important theoretical results of calculus. 


Suppose that f(*) = xg(x) for some function g which is continuous at 0. 
Prove that f is differentiable at 0, and find f’(O) in terms of g. 


Suppose f is differentiable at 0, and that f(0) = 0. Prove that f(x) = xg(x) 
for some function g which is continuous at 0. Hint: What happens if you try 
fo waite 2 (Oey f(x )/x? 
If f(x) =x” for n mN, prove that 
ak — © yonck 

(ERE 


= ta(" i 7 Vs for. A 0). 


FOC) =(-D 


Prove that it is impossible to write x = f(x)g(x) where f and g are differ- 
entiable and f (0) = g(0) = 0. Hint: Differentiate. 


What is f(x) if 


(a) f(x) =1/@ —a)"? 
*(b) f(y) =e foe =)? 


Let (Gr) =a" sinit/x if x 40, and let (0) = Or Prove that (0), .-., 
f'(O) exist, and that f is not continuous at O. (You will encounter the 
same basic difficulty as that in Problem 21.) 


Let f(x) = x7"! sin I/x if x 4 0, and let ((0) = 0. Prove'that f’(0),.... 
f(O) exist, that f is continuous at 0, and that f” is not differentiable 


ll Oi 
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In Leibnizian notation the Chain Rule ought to read: 


df(g(x)) _ df(y) dg(x) 
CY |, Oe 
Instead, one usually finds the following statement: “Let y = g(x) and 
I= F (y). Then dz dz dy” 
dx dy dx’ 


Notice that the z in dz/dx denotes the composite function f © g, while the z 
in dz/dy denotes the function f; it is also understood that dz/dy will be “an 
expression involving y,” and that in the final answer g(x) must be substituted 
for y. In each of the following cases, find dz/dx by using this formula; then 
compare with Problem 1. 

i) zg=siny, y=x+x?. 

(i) z=siny; y= cosx; 

(iii) z2=snu, «= sinx. 

Gv) <== sina US COS. «SIN. x3 


CHAPTER 
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x1 X2 X3 


MGUR Et 


THEOREM 1 


PROOF 


SIGNIFICANCE OF THE DERIVATIVE 


One aim in this chapter is to justify the time we have spent learning to find the 
derivative of a function. As we shall see, knowing just a little about f’ tells us a 
lot about f. Extracting information about f from mformation about f’ requires 
some difficult work, however, and we shall begin with the one theorem which is 
really easy. 

‘This theorem is concerned with the maximum value ofa function on an interval. 
Although we have used this term informally in Chapter 7, it is worthwhile to be 
precise, and also more general. 


Let f be a function and A a set of numbers contained in the domain of f. 


A point x in A is a maximum point for f on A if 


FQ)= fy) forever on 7A: 


The number f(x) itself is called the maximum value of f on A (and we also 
say that f “has its maximum value on A at x”). 


Notice that the maximum value of f on A could be f(x) for several different x 
(Figure 1); in other words, a function f can have several different maximum points 
on A, although it can have at most one maximum value. Usually we shall be 
interested in the case where A is a closed interval [a, b|; if f is continuous, then 
Theorem 7-3 guarantees that f does indeed have a maximum value on [a, b]. 

The definition of a minimum of f on A will be left to you. (One possible 
definition is the following: f has a minimum on A at x, if —f has a maximum 
on A at x.) 

We are now ready for a theorem which does not even depend upon the existence 
of least upper bounds. 


Let f be any function defined on (a, b). If x is a maximum (or a minmum) poimt 
for f on (a,b), and f is differentiable at x, then f’(x) = 0. 

(Notice that we do not assume differentiability, or even continuity, of f at other 
poimts.) 


Consider the case where f has a maximum at x. Figure 2 illustrates the simple idea 
behind the whole argument—-secants drawn through points to the left of (x, f(x) 
have slopes > 0, and secants drawn through points to the right of (x, f(v)) have 
slopes < 0. Analytically, this argument proceeds as follows. 
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If A is any number such that x + / is in (a,b), then 
bla) = F(x +h), 
since f has a maximum on (a, b) at x. ‘Vhis means that 
ft +h) — f(x) <0. 


Thus, if 4 > 0 we have 


; 0. 


and consequently 
aa) — J OC) 
CT re 


=): 
h—0Ot h 


On the other hand, if h < 0, we have 


faea se 20. 
1 


sO 
esl) — ; 
lim oS) = ee) ae) = il?) = 


(0). 
h>0- h 


By hypothesis, f is differentiable at x, so these two limits must be equal, in fact 
equal to f’(x). This means that 


oem md f(x) = 0, 


from which it follows that f’(x) = 0. 
The case where f has a minimum at x is left to you (give a one-line proof). ff 


Notice (Figure 3) that we cannot replace (a. b) by [a, b| in the statement of the 
theorem (unless we add to the hypothesis the condition that x is in (a, b).) 

Since f’(x) depends only on the values of f near x, it is almost obvious how to 
get a stronger version of Theorem |. We begin with a definition which 1s illustrated 
in Figure 4. 


Let f be a function, and A a set of numbers contained in the domain of f. 
A point x in A is a local maximum [minimum] point for f on A if 


there is some 6 > O such that x is a maximum [minimum] point for f on 
AN (x —6.x +65). 


If x is a local maximum or minimum for f on (a,b) and f is differentiable at x. 


fliem ty ix y= 10) 


You should see why this is an easy application of Theorem |. J 


190) Derwatives and Integrals 


DEFINITION 


a local 
Minimum point 


PEG URE 4 


X2 


a local 
maximum point 


The converse of Theorem 2 is definitely not rue —it is possible for f’(x) to be 0 
even if x is not a local maximum or minimum point for f. The simplest example 
is provided by the function f(«) = x?; in this case f’(0) = 0, but f has no local 
maximum or minimum anywhere. 

Probably the most widespread misconceptions about calculus are concerned 
with the behavior of a function f near x when f’(x) = 0. The point made in 
the previous paragraph is so quickly forgotten by those who want the world to be 


the 
condition f’(x) = 0 docs not imply that x 1s a local maximum or minimum point 


simpler than it is, that we will repeat it: the converse of Theorem 2 is not true 


of f. Precisely for this reason, special terminology has been adopted to describe 
numbers x which satisfy the condition f’(x) = 0. 


A critical point of a function f is a number x such that 


fe 0) 


The number f(x) itself is called a critical value of ff. 


The critical values of f, together with a few other numbers, turn out to be the 
ones which must be considered in order to find the maximum and minimum of a 
given function f. To the unmitiated, findmg the maximum and minimum value 
of a function represents one of the most intriguing aspects of calculus, and there 
is no denying that problems of this sort are fun (unul you have done your first 
hundred or so). 

Let us consider first the problem of finding the maximum or minimum of f 
on a closed interval [a,b]. (Then, if f 1s continuous, we can at least be sure 
that a maximum and minimum value exist.) In order to locate the maximum and 
miuumum of f three kinds of points must be considered: 


(1) ‘The critical pomts of f m [a, 5]. 
(2) The end pomts a and b. 
(3) Points x in [a,b] such that f is not differentiable at x. 


If x is amaximum point or a minimum point for f on [a,b], then x must be in one 
of the three classes hsted above: for if x is not in the second or thrd group, then 
x is in (a,b) and f zs differentiable at x; consequently f’(~) = 0, by Theorem 1, 
and this means that + ts m the first group. 

If there are many pomts in these three categories, finding the maxumum and 
minimum of f may still be a hopeless proposition, but when there are only a few 
eritical points, and only a few points where f is not diflerentiable, the procedure ts 
fairly straightforward: one simply finds f(x) for each x satisfying f’(x) = 0, and 
f(x) for each x such that f is not differentiable at x and, finally, f(a) and f(b). 
The biggest of these will be the maximum value of f, and the smallest will be the 
minimum. A simple example follows. 


GRY ees 


| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
} 
| 
I 
| 
| 


11, Significance of the Derivative 191 


Suppose we wish to find the maximum and mmimum value of the function 
7; 3 : 
Jigt=x> =X 
on the interval [—1, 2]. To begin with, we have 
3) 
yf (x = 3x6 — 1, 
so f’(x) = 0 when 3x7 — 1 = 0, that is, when 
fons or —/ 1/3. 
The numbers / 1/3 and —/ 1/3 both he in [—1, 2], so the first group of candidates 


for the location of the maximum and the minimum is 


GQ) eili/3, —a/ 1/3. 


The second group contains the end poimts of the interval, 
yee li. 2. 


The third group is empty, since f is differentiable everywhere. ‘The final step 1s to 
compute 


fV1/3) = V1/39 = JIB = 57173 - V1/3 = -§3V1/3, 

FeV sti =e) ew /3) = 5/1/73 3 = 
jo = 0 
fQ)=6. 


Clearly the mmmnum value ts =3/ 1/3, occurring at /1/3, and the maximum 
value is 6, occurring at 2. 

This sort of procedure, if feasible, will always locate the maximum and minmum 
value of a continuous function on a closed interval. If the function we are dealing 
with is not continuous, however, or if we are seeking the maximum or Miumum 
on an open interval or the whole line, then we cannot even be sure beforehand 
that the maximum and mmimum values exist, so all the mformation obtained by 
this procedure may say nothing. Nevertheless, a htthe ingenuity will often reveal 
the nature of things. In Chapter 7 we solved just such a problem when we showed 
that if 7 is even, then the function 


Oe a) eM ae C0 


has a minum value on the whole line. This proves that the minimum value must 
occur at some number x satisfying 


(= fem +e Da? re 


If we can solve this equation, and compare the values of f(x) for such x, we can 
actually find the mmimmum of f. One more example may be helpful. Suppose we 
wish to find the maximum and minimum, if they exist, of the function 

I 
1 — x? 


oe 
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on the open interval (—1!, 1). We have 


2x 


ae 


(1 = x2)2 


Paes 


so f’(x) = O only for x = 0. We can see immediately that for x close to | or —1 the 
values of f(x) become arbitrarily large, so f certainly does not have a maximum. 
This observation also makes it easy to show that f has a mmimum at 0. We just 
note (Figure 5) that there will be numbers a and b, with 


—<@=0 and O0<b <1, 
such that f(x) > f(O) for 
—i-emwaaq and b<x = 1. 


This means that the mmimum of f on [a,b] is the mmimum of f on all of 
(—1,1). Now on [a,b] the mintmum occurs either at 0 (the only place where 
f’ = 0), or ata or b, and a and b have already been ruled out, so the minimum 
valucis /(0) = 1. 

In solving these problems we purposely did not draw the graphs of f(r) = «3 —x 
and f(x) = 1/(1 — x7), but it is not cheating to draw the graph (Figure 6) as long 
as you do not rely solely on your picture to prove anything. ‘As a matter of fact, we 
are now going to discuss a method of sketching the graph ofa function that really 
gives enough formation to be used in discussmg maxima and minima—in fact 
we will be able to locate even /ocal maxuna and minima. This method involves 
consideration of the sign of f’(x), and relies on some deep theorems. 


The theorems about derivatives which have been proved so far, always yield 
information about f’ in terms of information about f. This is true even of Theo- 
rem |, although this theorem can sometimes be used to determme certain informa- 
tion about f, namely, the location of maxima and minima. When the derivative 
was first introduced, we emphasized that f’(x) is not [f(x +h) — f(x)]/h for any 
particular h, but only a limit of these numbers as ft approaches 0; this fact becomes 


painfully relevant when one tries to extract information about f from information 
about f’. The smplest and most frustrating illustration of the difficulties encoun- 
aa tered is afforded by the followmg question: If f’(x) = 0 for all x, must f be a 
Px? constant function? It is impossible to imagine how f could be anything else, and 


this conviction is strengthened by considering the physical interpretation — if the 
velocity of a particle is always 0, surely the particle must be standing still! Never- 
theless it is dificult even to begin a proof that only the constant functions satisfy 
i Se gee f'(x) = 0 for all x. The hypothesis f’(x) = 0 only means that 


. f@et+h — ff) 
lin ———_———_ 
h—0 h 


— ii) 


FIGURE 6 and it is not at all obvious how one can use the information about the hniit to 
derive information about the function. 
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FIGURE 8 


THEOREM 3 (ROLLE’S THEOREM) 
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The fact that f 1s a constant function if f’(x) = 0 for all x, and many other facts 
of the same sort, can all be derived from a fundamental theorem, called the Mean 
Value Theorem, which states much stronger results. Figure 7 makes it plausible 
that if f is differentiable on [a,b], then there 1s some x i (a,b) such that 


f(b) — fla) 


b—-a 


fa)= 


Geometrically this means that some tangent line is parallel to the line between 
(a, f(a)) and (b, f(b)). The Mean Value Theorem asserts that this 1s trtue—there 
is some x m (a,b) such that f’(x), the instantaneous rate of change of f at x, 1s 
exactly equal to the average or “mean” change of f on [a, b], this average change 
being [ f(b) — f(a@)|/[b — a]. (lor example, if you travel 60 miles in one hour, 
then at some time you must have been traveling exactly 60 miles per hour.) This 
theorem is one of the most important theoretical tools of calculus—probably the 
deepest result about derivatives. From this statement you might conclude that the 
proof is difficult, but there you would be wrong-—the hard theorems in this book 
have occurred long ago, in Chapter 7. It is true that if you try to prove the Mean 
Value ‘Theorem yourself you will probably fail, but this is neither evidence that the 
theorem 1s hard, nor something to be ashamed of. ‘The first proof of the theorem 
was an achievement, but today we can supply a proof which is quite simple. It 
helps to begin with a very special case. 


If f is continuous on [a,b] and differentiable on (a,b), and f(a) = f(b), then 
theresa numer © un (aw>) such that (’(x) = 0; 


If follows from the continuity of f on [a,b] that f has a maximum anda minimum 
value on [a, b}. 

Suppose first that the maximum value occurs at a pomt x m (a,b). Then 
f'(x) =0 by Theorem 1, and we are done (Figure 8). 

Suppose next that the minimum value of f occurs at some point +. m (a,b). 
Then, again, f’(x) = 0 by Theorem 1 (Figure 9). 

Finally, suppose the maximum and mmimum values both occur at the end 
points. Since f(a) = f(b), the maximum and minimum values of f are equal, 
so f is a constant function (Figure 10), and for a constant function we can choose 
any x in (a,b). fj 


Notice that we really needed the hypothesis that f is differentiable everywhere 
on (a,b) m order to apply Theorem 1. Without this assumption the theorem is 
false (Figure 11). 

You may wonder why a special name should be attached to a theorem as easily 
proved as Rolle’s Theorem. ‘The reason is, that although Rolle’s Theorem is a 
special case of the Mean Value ‘Vheorem, it also yields a simple proof of the Mean 
Value ‘Theorem. In order to prove the Mean Value Theorem we will apply Rolle’s 
Theorem to the function which gives the length of the vertical segment shown m 
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Figure 12; this is the difference between f(x), and the height at x of the line L 
between (a, f(a)) and (b, f(b)). Since L is the graph of 


ro) == 


(x —a) + fla), 


| i b-a 
a b 
we want to look at 
FIGURE 11 : 
nes 
ee a] (aaa) oat (a): 


As it turns out, the constant f(a) is irrelevant. 


THEOREM 4 (THE MEAN VALUE _ If f is continuous on [a,b] and differentiable on (a, b), then there is a number x 
THEOREM) in (a,b) such that 
f@)— f(a) 


b—a 


a) — 


PROOF Let 


A(x) = fa)—= ore (x — a). 


b-—a 


Clearly, / is continuous on [a,b] and differentiable on (a, b), and 


hay — i ay 


; (b) — fla) 
C2) h(b) = f(b) — eoeel (x —a) 
bea 
- = Kay 
Consequently, we may apply Rolle’s Theorem to /# and conclude that there ts 
(a. f(a)) some x in (a,b) such that 
lh / (b)— f(a) 
x 0.= ho=f G)— Nd eB 
v=o 
FIGURE 12 so that 


Le f(by— INE j 


b-a 


fia 


Notice that the Mean Value Theorem still fits into the pattern exhibited by 
previous theorems -information about f yields information about f’. ‘Vhis mfor- 
mation is so strong, however, that we can now go in the other direction. 


COROLLARY 1 If f is defined on an interval and f’(x) = 0 for all x in the interval, then f is 
constant on the interval. 


PROOF Let a and b be any two points in the interval with a 4 b. Then there is some x in 


FIGURE 13 


COROLLARY 2 


PROOF 


DEFINITION 


COROLLARY 3 


PROOF 
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(a,b) such that 
HD) — f(a) 


b-—a 


fo) = 


But f’(x) = 0 for all x in the interval, so 


_ f(b) = fla) 


0 
b-a 


5 


and consequently f(a) = f(b). Thus the value of f at any two points in the 
interval is the same, Le., f is constant on the interval. 


Naturally, Corollary | does not hold for functions defined on two or more in- 
tervals (Figure 13). 


If f and g are defined on the same interval, and f’(x) = g’(x) for all x in the 
interval, then there is some number c such that f = g +c. 


For all x in the interval we have (f — g)'(x) = f’(x) — g(x) = 0 so, by Corollary 1, 
there is a number c such that f —g =c. J 


The statement of the next corollary requires some terminology, which is illus- 
trated in Figure 14. 


A function is increasing on an interval if f(a) < f(b) whenever a and b are 
two numbers in the interval with a < b. The function f is decreasing on 
an interval if f(a) > f(b) for all a and b im the interval with a < b. (We 


often say simply that f is increasing or decreasing, in which case the interval is 


g, 
understood to be the domain of ff.) 


If f(x) > O for all x in an interval, then f is ncreasing on the interval; if f’(x) < 0 
for all x im the interval, then f is decreasing on the interval. 


Consider the case where f’(x) > 0. Let a and b be two pomts in the interval with 
a < b. Then there is some x in (a,b) with 


P "b) — f@ 
i (x) — pO as 
b—-—a 

But) Ge Wier all 1m (a. b), so 


age! (Co) 
—_______. > 


b-a 


0. 


Since b —a > 0 it follows that f(b) > f(a). 
The proof when f’(x) < 0 for all x is left to you. J 
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Notice that although the converses of Corollary | and Corollary 2 are true (and 
obvious), the converse of Corollary 3 is not true. If f is increasing, it 1s easy to 
see that f’(x) > 0 for all x, but the equality sign might hold for some x (consider 
Ti) ea 

Corollary 3 provides enough mformation to get a good idea of the graph of 
a function with a minimal amount of pot plotting. Consider, once more, the 


function f(x) =x? — x. We have 


f (ay 3x- — 1. 


We have already noted that f’(x) = 0 for x = ¥1/3 and x = —J/1/3, and it is 
also possible to determine the sign of f’(x) for all other x. Note that 3x7 — 1 > 0 
precisely when 


8x° 
se l 
i => Bi 
| x sy or ee = lS 
1 : ks thus 3x2 — | <0 precisely when 
(a) an increasing function ayia 13, 


Thus f is increasmg for x < —/J1/3, decreasing between —/1/3 and 1/3, 
and once again increasing for x > /1/3. Combining this information with the 
following facts 


Cl) efi =/ Wen ox /1/35 
fO/1/3) = -§V1/3. 


! ! : Q)*/ = eitra —=1, Oe 


(3) f(x) gets large as x gets large, and large negative as x gets large negative, 


eS) TSS) 


(b) a decreasing function 
FIGURE 14 it is possible to sketch a pretty respectable approximation to the graph (Figure 15). 
By the way, notice that the intervals on which f increases and decreases could 
have been found without even bothermg to examine the sign of f’. For example, 
since f’ is contmuous, and vamshes only at —¥ 1/3 and /1/3, we know that f’ 
always has the same sign on the interval (—V 1/3, /1/3). Since f(—v1/3) > 
f(V 1/3), it follows that f decreases on this interval. Similarly, f’ always has the 
same sign on (/ 1/3, 00) and f(x) is large for large x, so f must be increasing on 
(/1/3, 00). Another point worth noting: If f’ is continuous, then the sign off’ 
on the interval between two adjacent critical points can be deternuned simply by 
finding the sign of f(x) for any one x m this mterval. 


Our sketch of the graph of f(x) = x* — x contains sufficient information 
to allow us to say with confidence that —/ 1/3 is a local maximum poimt, and that 


JV 1/3 is a local minimum pomt. In fact, we can give a general scheme for deeid- 
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f@)=x-—x 


f increasing f decreasing f increasing 


| | 
| | 
! | 
! \ 
! | 
i 


FIGURE 15 


ing whether a critical point is a local maximum point, a local minimum point, or 
neither (Figure 16): 


(1) if f’ > 0 in some interval to the left of x and f’ < 0 in some interval to 
the right of x, then x is a local maximum point. 

(2) if f’ < 0 in some interval to the left of x and f’ > 0 in some interval to 
the right of x, then x is a local minimum point. 

(3) if f’ has the same sign in some interval to the left of x as it has in some 
interval to the right, then x is neither a local maximum nor a local minimum 
point. 


(There is no point in memorizing these rules—you can always draw the pictures 
yourself.) 

The polynomial functions can all be analyzed in this way, and it is even possible 
to describe the general form of the graph of such functions. ‘To begin, we need a 


+ 

x x x 
os + + — + 
f' <0 f'>0 PSH SO f' <0 f' <9 

(b) (c) (d) 


FIGURE 16 
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result already mentioned in Problem 3-7: If 


! 


F(X) = Gyx" + dy-1xX"~" + +++ + a0, 


then f has at most n “roots,” 1.e., there are at most m numbers x such that 
f(x) = 0. Although this is really an algebraic theorem, calculus can be used 
to give an casy proof. Notice that if x; and x2 are roots of f (Figure 17), so that 
f(a) = fQe2) = 0, then by Rolle’s Theorem there is a number x between x, 
and x2 such that f’(+) = 0. This means that if f has k different roots x; < x2 < 

- < xz, then f’ has at least k — 1 different roots: one between x; and x2, one 
between x2 and x3, ete. It is now easy to prove by induction that a polynomial 


Gi RIE 7 function 


| 


fAX) = ene tr Gee a aes IO 


has at most n roots: The statement is surely true for n = 1, and if we assume that 
it is true for n, then the polynomial 


BOX) = dy py x"t! + dy x" +++ + bo 


could not have more than n + | roots, since if it did, g’ would have more than n 
roots. 
With this mformation it is not hard to describe the graph of 


fe) Sayx" + a,x"! +--+. +p. 


The derivative, being a polynomial function of degree n — 1, has at most 
n— 1 roots. Therefore f has at most 2 — | critical points. Of course, a criti- 
cal point is not necessarily a local maximum or minimum point, but at any rate, 
if a and b are adjacent critical points of f, then f’ will remain either positive or 
negative on (a,b), since f’ is continuous; consequently, f will be either creasing 
or decreasing on (a,b). Thus f has at most 1 regions of decrease or crease. 


As a specific example, consider the function 
3 4) 
EGY = ee 
Since 


dy = 4x? — 4x = 4x(x —1)@ +1), 


the critical points of f are —1, 0, and 1, and 
1(0, 0) f(-D) =-1. 
ae” ino) 
7 hs fd) =-1. 
7 | im 
= | ° a oF a ; 
(Alea | (J,—1) The behavior of f on the intervals between the critical points can be determined 


by one of the methods mentioned before. In particular, we could determine the 
sien of f’ on these intervals simply be examining the formula for f’(x). On the 
other hand, from the three critical values alone we can see (igure 18) that f 
increases on (—1,0) and decreases on (QO. 1). To determine the sign of f’ on 


FIGURE 18 
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(—oo, —1) and (1, 0) we can compute 


and conclude that f is decreasing on (—oo, —1) and increasing on (1, 00). These 
conclusions also follow from the fact that f(x) is large for large x and for large 
negative x, 

We can already produce a good sketch of the graph; two other pieces of infor- 
mation provide the finishing touches (Figure 19). First, it is easy to determine that 
f@) =0 toga, sen! 2. second, it is clear that f is even, f(x) = f(—+x), so the 
graph is symmetric with respect to the vertical axis. The function f(x) = x? — x, 
already sketched in Figure 15, is odd, f(x) = —f(— x), and is consequently sym- 
metric with respect to the origin. Half the work of graph sketching may be saved 
by noticing these things in the beginning. 


me) x 25 


(-1,-1) cn! 


PGR Ene 


Several problems in this and succeeding chapters ask you to sketch the graphs 
of functions. In each case you should determine 


1) the critical points of f, 

(2) the value of f at the critical points, 

(3) the sign of f’ in the regions between critical points (if this is not already 
clear), 


(4) the numbers x such that f(+) = 0 (if possible), 
(5) the behavior of f(x) as x becomes large or large negative (if possible). 


Finally, bear in mind that a quick check, to see whether the function is odd or 
even, may save a lot of work. 

This sort of analysis, if performed with care, will usually reveal the basic shape 
of the graph, but sometimes there are special features which require a little more 
thought. It is impossible to anticipate all of these, but one piece of information is 
often very important. If f is not defined at certain points (for example, if f 1s a 
rational function whose denominator vanishes at some points), then the behavior 
of f near these points should be determined. 

lor example, consider the function 
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which is not defined at 1. We have 


(= Cr) Ga) 


fi@)= - ap 
ee 2) 
gi be 
Thus 
(1) the critical points of f are 0, 2. 
Moreover, 
(2) f(O) = —2, 
wn 


Because f is not defined on the whole interval (0, 2), the sign of f’ must be 
determined separately on the intervals (0, 1) and (1, 2), as well as on the intervals 
(—oo, 0) and (2, oo). We can do this by picking particular points in each of these 
intervals, or simply by staring hard at the formula for f’. Either way we find that 


Gyfa)ys0 1 saa! 
i @ic.D 0b. wal. 
fap=0 i 1 =<te= Be 
Lap) > O gil @ @= x. 


Finally, we must determine the behavior of f(x) as x becomes large or large 
negative, as well as when x approaches | (this information will also give us another 
way to determine the regions on which f increases and decreases). ‘lo examine 
the behavior as x becomes large we write 


x*—2x +2 | 
———$— =x-l+—; 
ed, x-] 
clearly f(x) is close to x — | (and slightly larger) when x is large, and f(x) is close 
to x — | (but slightly smaller) when x is large negative. ‘The behavior of f near | 
is also easy to determine; since 


lim (x* Dy 2 0, 


the fraction 
x*—2x+42 
x-—1 

becomes large as x approaches | from above anid large negative as x approaches | 
from below. 

All this information may seem a bit overwhelming, but there is only one way 
that it can be pieced together (Figure 20); be sure that you can account for each 
feature of the graph. 


When this sketch has been completed, we might note that it looks hke the graph 


THEOREM 5 


PROOF 
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FIGURE 20 


of an odd function shoved over | unit, and the expression 


9 9 
toate (=) ed 
ee x—! 
shows that this is indeed the case. However, this is one of those special features 


which should be investigated only after you have used the other information to get 
a good idea of the appearance of the graph. 

Although the location of local maxima and minima of a function is always re- 
vealed by a detailed sketch of its graph, it is usually unnecessary to do so much 
work. ‘There is a popular test for local maxima and minima which depends on the 
behavior of the function only at its critical points. 


Suppose f’(a) = 0. If f"(a) > 0, then f has a local minimum at a; if f(a) < 0, 
then f has a local maximum at a. 


By definition, 


, . ££ (a+h)—f@ 
f @) iO h 


Since f’(a) = 0, this can be written 


‘tied 
f(a) = lim Rey 
h—-0 h 
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i) a 


(c) 


FIGURE 21 


THEOREM 6 


PROOF 


Suppose now that f"(a) > 0. Then f'(a + /)/h must be positive for sufficiently 
small h. ‘Therefore: 


f(a +h) must be positive for sufficiently small h > 0 
and f’(a +h) must be negative for sufficiently small h < 0. 


‘This means (Corollary 3) that f is increasing in some interval to the right of a 
and f is decreasing in some interval to the left of a. Consequently, f has a local 
minimum at a. 

The proof for the case f’(a) < 0 is similar. J 


Theorem 5 may be applied to the function f(x) = x3 — x, which has already 


been considered. We have 


f'(x) = 3x? -1 
a OW 


At the critical points, — / 1/3 and /1/3, we have 


fie. a Soy We 0. 
Wey ea 0 


Consequently, — /1/3 is a local maximum point and /1/3 is a local minimum 
poimt. 

Although Theorem 5 will be found quite useful for polynomial functions, for 
many functions the second derivative is so complicated that it is easier to consider 
the sign of the first derivative. Moreover, if a is a critical point of f it may happen 
that f”(a) = 0. In this case, Theorem 5 provides no information: it is possible 
that a is a local maximum point, a local mmimum poimt, or neither, as shown 
(Figure 21) by the functions 


fx)= So) == AS me 


in each case f’(0) = f"(0) = 0, but 0 is a local maximum point for the first, a 
local minimum point for the second, and neither a local maximum nor minimum 
point for the third. This point will be pursued further in Part TV. 

It is interesting to note that ‘Vheorem 5 automatically proves a partial converse 
of itself: 


Suppose f(a) exists. If f has a local minimum at a, then f"(a) > 0; if f has a 
local maximum at a, then f"(a) < 0. 


Suppose f has local minunum at a. If f"(a) < 0, then f would also have a 
local maxunum at a, by Vheorem 5. ‘Vhus f would be coustant i some interval 
contaming a, so that f”(a) = 0, a contradiction. Vhus we must have f”(a) => 0. 


The case of a local maximuni is handled similarly. J 


LGA R IE, 7292 


THEOREM 7 


PROOF 
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(This partial converse to Theorem 5 is the best we can hope for: the > and < 
sigs cannot be replaced by > and <, as shown by the functions f(x) = ++ and 
{[@). 

The remainder of this chapter deals, not with graph sketching, or maxima and 
minima, but with three consequences of the Mean Value Theorem. The first is a 
simple, but very beautiful, theorem which plays an mportant role in Chapter 15, 
and which also sheds ight on many examples which have occurred in previous 
chapters. 


Suppose that f is continuous at a, and that f’(v) exists for all x in some interval 
containing a, except perhaps for x = a. Suppose, moreover, that lim f’(x) exists. 


A 


Then f'(a) also exists, and 


aie lim f Cie 


X—>a 


By definition, 
: (G@utdb ee f (a) 
7 (a) = lim J M@oatecliduas F(a) 
h—>0 h 
For sufficiently small h > 0 the function f will be continuous on fa,a +h] and 
diflerentiable on (a,a +h) (a similar assertion holds for sufficiently small hf < 0). 
By the Mean Value Theorem there 1s a number @ in (a,a + /) such that 


af Cea | (a) 
h 
Now a, approaches a as fh approaches 0, because a, is m (a,a +h); since 
lim f’(x) exists, it follows that 


3G 0! 


= f' (an). 


} — 
T'(a) tli ee eat 
h—0 h h-O x—a 


(It is a good idea to supply a rigorous ¢€-5 argument for this final step, which we 
have treated somewhat informally.) § 


Even if f is an everywhere differentiable function, it is still possible for f’ to be 
discontinuous. ‘This happens, for example, if 


ee ; 
oe x sm)— ea) 


O. ve — Op 


According to ‘Theorem 7, however, the graph of f’ can never exhibit a disconti- 
nuity of the type shown in Figure 22. Problem 61 outlines the proof of another 
beautiful theorem which gives further information about the function f’, and Prob- 
lem 62 uses this result to strengthen Theorem 7. 

The next theorem, a generalization of the Mean Value Theorem, is of interest 


mainly because of its applications. 
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THEOREM 8 (THE CAUCHY MEAN 
VALUE THEOREM) 


PROOF 


THEOREM 9 (L’HOPITAL’S RULE) 


If f and g are continuous on [a,b] and differentiable on (a,b), then there is a 
number x m (a,b) such that 


[f(b) — f@]g') = [g) — g@] f'(). 

(If g(b) 4 g(a), and g’(x) # 0, this equation can be written 

fb)= fla) f'@) 

g(b)— g(a) g(x) 
Notice that if g(x) = x for all x, then g’(x) = 1, and we obtam the Mean Value 
Theorem. On the other hand, applying the Mean Value Theorem to f and g 
separately, we find that there are x and y in (a, b) with 

f(b) = fla) f'@). 

g(b)— g(a) g’(y) 
but there is no guarantee that the x and y found in this way will be equal. ‘Vhese 


remarks may suggest that the Cauchy Mean Value Theorem will be quite difficult 
to prove, but actually the simplest of tricks suffices.) 


et 
h(x) = f@)[g®) - g(@)] —s@) fF) — fa). 
Then / is continuous on [a, b], differentiable on (a,b), and 
h(a) = f(a)g(b) — g(a) f(b) =h(d). 


It follows from Rolle’s ‘Theorem that h’(x) = 0 for some x in (a, b), which means 
that 
0= f'()[g) — g(a] -—g@[fb)—- f@).t 


The Cauchy Mean Value Theorem is the basic tool needed to prove a theorem 
which facilitates evaluation of hmits of the form 


when 


lim f (x) =,0,.and. line Gy i= 0: 


xa 
In this case, Theorem 5-2 is of no use. Every derivative is a hmit of this form, and 
computing derivatives frequently requires a great deal of work. Ifsome derivatives 
are known, however, many limits of this form can now be evaluated easily. 


Suppose that 
lim f(x)=0 and hm g(x) =0, 


Yaa 


and suppose also that lim f’(x)/g’(x) exists. Phen hin f(x)/g(¥) exists, and 
35 til A a( i! 


se Ce) 
| ai) 
ra g (xs) Ned (i) 


(Notice that Theorem 7 is a special case.) 


PROOF 
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The hypothesis that lim f’(x)/g'(x) exists contains two implicit assumptions: 
Air ad t 


(1) there is an interval (a —5,a+5) such that f’(x) and g’(x) exist for all x in 
(a = 0} G0) excepttpernaps tor + = a, 

(2) in this interval g(x) #4 O with, once again, the possible exception 
of « =e 


On the other hand, f and g are not even assumed to be defined at a. If we define 
f(a) = g(a) = O (changing the previous values of f(a) and g(a), if necessary), 
then f and g are continuous at a. If a < x < a+, then the Mean Value 
Theorem and the Cauchy Mean Value Theorem apply to f and g on the interval 
[a,x] (and a similar statement holds for a —6 < x <a). First applying the Mean 
Value Theorem to g, we see that g(x) 4 0, for if g(x) = 0 there would be some x 
in (a,x) with g’(x;) = 0, contradicting (2). Now applying the Cauchy Mean Value 


Theorem to f and g, we see that there is a number a@, in (a,x) such that 


[ f(x) — O]g’(ax) = [g(x) — 0] f’ (ax) 


or 
ce) — fo) 
BS) GR, 


Now a, approaches a as x approaches a, because a@, is in (a,x); since we are 
assuming that lim f’(y)/g’(y) exists, it follows that 
ya 
Gon. eACoee ere 


lim = lim = hm : 
xa g(x) x—a g' (ay) ya g'(y) 


(Once again, the reader is invited to supply the details of this part of the argu- 
ment.) § 


PROBLEMS 


1. For each of the following functions, find the maximum and minimum values 
on the indicated intervals, by finding the points in the interval where the 
derivative 1s 0, and comparing the values at these points with the values at 
the end points. 


Gi) f(x) =a =x? —8x+1 on [-2,2]. 


Gi) f@)=xeP+x41 oul til: 
(iil) *f (x) "3x P Gx? Fon [-5. 5]. 
(iv) f@)= 3 on [—3, 1]. 
) fossa on [-1. 4]. 
(fa on [0, 5]. 
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2. 


4. 


Now sketch the graph of each of the functions in Problem 1, and find all 
local maximum and minimum points. 


Sketch the graphs of the following functions. 


( 

Gy 
(11) 
‘iv) 


(a) 


*(b) 


l 
f(x)=x4+-. 
x 
3 
ENS) = he 
x7 
z 
f(x) = 
{o)= >. 
xe 
f(x) = —_>:- 
| tae 
n 
~ ~ ak, 8 7 ye ? 
If ay < +--+ < ay, find the minimum value of f(x) = ) (x —aj)~. 
i=l 
n 
Now find the minimum value of f(x) = “Fe |x —a;|. This is a problem 


i=l 
where calculus won't help at all: on the intervals between the a;’s the 


function /f is linear, so that the minimum clearly occurs at one of the a;, 
and these are precisely the points where f is not differentiable. However, 
the answer is easy to find if you consider how f(x) changes as you pass 
from one such interval to another. 

Let a > 0. Show that the maximum value of 


= + 
PO) EE Sal 


is (2+ a)/(1 +a). (The derivative can be found on each of the intervals 
(—o0, 0), (0, a), and (a, 0%) separately.) 


For each of the following functions, find all local maximum and minimum 
points. 

e *% #3597759 

a xO 
Q)) 6fitoS —3,. + =5 

Se a 

i x9: 


(V) 


Q, x irrational 
l/q,. Xx = p/q in lowest terms. 


f(x) x, x rational 
5) — j : 
‘ Q, x irrational. 
1, x =1/n for some nin N 
0, otherwise. 


|, if the decimal expansion of x contains a 5 
0, otherwise. 


(1, Db) 


TlG WERE 3 


NO, 


h 


Surface area is the 


sum of these areas | ~~, 


FIGURE 24 > 


FIGURE 25 


BM 


£2. 


TS; 
14. 


15: 
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Prove the following (which we often use implicitly): If f is mcreasing on (a, b) 
and continuous at a and b, then f is increasing on [a, b]. In particular, if f 
is continuous on [a,b] and f’ > 0 on (a,b), then f is increasing on [a, b]. 


A straight line is drawn from the point (0,a) to the horizontal axis, and 
then back to (1,6), as in Figure 23. Prove that the total length is shortest 
when the angles @ and f are equal. (Naturally you must bring a function 
into the picture: express the length in terms of x, where (x, Q) is the point 
on the horizontal axis. ‘The dashed line in Figure 23 suggests an alternative 
geometric proof; in either case the problem can be solved without actually 
finding the point (x, 0).) 


(a) Let (xo. yo) be a point of the plane, and let L be the graph of the function 
f(x) =mx +b. Find the point + such that the distance from (x9, yo) to 
(x, f(X)) is smallest. [Notice that mimimizing this distance is the same as 
minimizing its square. ‘This may simplify the computations somewhat. | 

(b) Also find x by noting that the line from (x9, yo) to (¥, f(*)) 1s perpen- 
dicular to L. 

(c) Find the distance from (xg, yo) to L, 1.e., the distance from (x9, vo) to 
(x, f(¥)). [It will make the computations easier if you first assume that 
b = 0; then apply the result to the graph of f(*) = mx and the point 
(xo, Yo — b).] Gompare with Problem 4-22. 

(d) Consider a straight line described by the equation Ax + By +C = 

0 (Problem 4-7). Show that the distance from (x9, yo) to this line is 


(Axo + Byo + C)/V A2 + B?. 


The previous Problem suggests the following question: What is the relation- 
ship between the critical points of f and those of f?? 


Prove that of all rectangles with given perimeter, the square has the greatest 
area. 


Find, among all right circular cylinders of fixed volume V, the one with 
smallest surface area (counting the areas of the faces at top and bottom, as 
in Figure 24). 


A right triangle with hypotenuse of length a is rotated about one of its legs 
to generate a right circular cone. Find the greatest possible volume of such 
a cone. 


Show that the sum of a positive number and its reciprocal is at least 2. 


Find the trapezoid of largest area that can be inscribed in a semicircle of 
radius a, with one base lying along the diameter. 


‘Two hallways, of widths a and b, meet at right angles (Figure 25). What 
is the greatest possible length of a ladder which can be carried horizontally 
around the corner? 
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FIGURE 30 


A garden is to be designed in the shape of a circular sector (Figure 26), with 
radius R and central angle 6. The garden is to have a fixed area A. For 
what value of R and 6 (in radians) will the length of the fencing around the 
perimeter be minimized? 


A right angle is moved along the diameter of a circle of radius a, as shown 
in Figure 27. What is the greatest possible length (A + B) intercepted on it 
by the circle? 


Ecological Ed must cross a circular lake of radius | mile. He can row across 
at 2 mph or walk around at 4 mph, or he can row part way and walk the 
rest (figure 28). What route should he take so as to 


(i) | see as much scenery as possible? 
(ii) cross as quickly as possible? 


(a) Consider points A and B on a circle with center O, subtending an angle 
of a = ZAOC (Figure 29). How must B be chosen so that the sum of 
the areas of AAOB and ABOC is a maximum? Hint: Express things 
in terms of 6 = ZAOB. 

(b) Prove that for n > 3, of all n-gons inscribed in a circle, the regular n-gon 
has maximum area. 


The lower right-hand corner of a page is folded over so that it just touches 
the left edge of the paper, as in Figure 30. If the width of the paper is @ and 
the page is very long, show that the minimum length of the crease is 3V3a/4. 


Figure 31 shows the graph of the derivative of f. Find all local maximum and 
minimum points of /f. 


FIGURE 31] 


Suppose that f is a polynomial function, f(x) = x" + a,—-jx""! +++» + a0, 
with critical pots —1, 1, 2, 3, 4, and corresponding critical values 6, 1, 2, 
4,3. Sketch the graph of f, distinguishing the cases 7 even and n odd. 


(a) Suppose that the critical pomts of the polynomial function f (+) = +" + 
Gy_{X"_ | + )---hagvares=al,al, 203, and f’(—1) = 0, Fb = 0, //Qa 
0, f"(3) = 0. Sketch the graph of f as accurately as possible on the 
basis of this information. 

(b) Does there exist a polynomial function with the above properties, except 
that 3 is not a critical point? 


Describe the graph of a rational function (in very general terms, similar to 
the text’s deseription of the graph of a polynomial function). 


25. 


26. 


B27. 


28. 


29. 


30. 
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(a) Prove that two polynomial functions of degree m and n, respectively, 
intersect in at Most max(m, 7) points. 

(b) For each m and nv exhibit two polynomial functions of degree m and n 
which intersect max(in,n) times. 


Suppose f is a polynomial function of degree n with f > 0 (son must be 
even). Prove that f + f’+ f” +-.-+ f™ >= 0. 


(a) Suppose that the polynomial function f(x) = x” + a,-yx""! +--+ + a9 
has exactly k critical points and f(x) 4 0 for all critical pots +. Show 
that n — k is odd. 

(b) For each n, show that if n —k is odd, then there is a polynomial function 
f of degree n with k critical points, at each of which f” is non-zero. 

(c) Suppose that the polynomial function f(x) = x" + Ae = a ee 
has k; local maximum points and k2 local minimum points. Show that 
ky =k, + 1 if n is even, and kp = ky, if n is odd. 

(d) Let n, ki, k2 be three integers with kz =k, + Lif n is even, and kz = k, if 
nis odd, and ky + kz <n. Show that there is a polynomial function f of 
degree n, with ky local maximum points and kz local minimum points. 

ky +k 
lint Piek a) <a> <--- < @,4,,'and toy (4 I] (x —a;)-( ae) 
for an appropriate number /. Zz 


(a) Prove that if f’(x) => M for all x in [a,b], then f(b) => f(ad+M(b—-a). 
(b) Prove that if f’(x) < M for all x in [a, b], then f(b) < f(a)+M(b—-a). 
(c) Formulate a similar theorem when | f’(x)| < M for all x in [a,b]. 


Suppose that f’(x) > M > 0 for all x in [0, 1]. Show that there is an interval 
of length i on which | f| > M/4. 


(a) Suppose that f’(x) > g’(x) for all x, and that f(a) = g(a). Show that 
f@®) > gs) tort atand G2 6 alorr "a. 

(b) Show by an example that these conclusions do not follow without the 
hypothesis f(a) = g(a). 

(c) Suppose that f(@) = g(a), that f’(x) > g’(x) for all x, and that f’(x0) > 
2’ (xo) for same X@ auc, Show that f(x) > et) for all x > xo. 


Find all functions f such that 


ie (= sine 

(by Fx) Sox? 

(i) f° OY= xe 

Although it is true that a weight dropped from rest will fall s(7) = 1612 
feet after f seconds, this experimental fact does not mention the behavior of 
weights which are thrown upwards or downwards. On the other hand, the 
law s(t) = 32 is always true and has just enough ambiguity to account for 
the behavior of a weight released from any height, with any initial velocity. 
kor simplicity let us agree to measure heights upwards from ground level: 
in this case velocities are positive for rising bodies and negative for falling 
bodies, and all bodies fall according to the law s”(1) = —32. 
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O35. 
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35. 


36. 
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(a) Show that s is of the form s(t) = — 16127 +ar+ Bh 

(b) By setting ¢ = O in the formula for s, and then in the formula for 5’, 
show that s(t) = —16t7 + vot +9, where so is the height from which the 
body is released at time O, and vo is the velocity with which it is released. 

(c) A weight is thrown upwards with velocity v feet per second, at ground 
level. How high will it go? (“How high” means “what is the maximum 
height for all times”.) What is its velocity at the moment it achieves its 
greatest height? What is its acceleration at that moment? When will it 
hit the ground again? What will its velocity be when it hits the ground 
again? 


A cannon ball is shot from the ground with velocity v at an angle @ (Fig- 
ure 32) so that it has a vertical component of velocity vsina@ and a hori- 
zontal component v cos@. Its distance s(t) above the ground obeys the law 
s(t) = —16r? + (vsina)r, while its horizontal velocity remaims constantly 
UCcOsa@. 


(a) Show that the path of the cannon ball is a parabola (find the position at 
each time f, and show that these points lie on a parabola). 
(b) Find the angle a which will maximize the horizontal distance traveled 


by the cannon ball before striking the ground. 
(a) Give an example of a function f for which hm f(x) exists, but 
XO 
lim f’(x) does not exist. 
X00 
(b) Prove that if lim f(x) and lm f’(x) both exist, then lim f’(x) = 0. 
iis ue ae. 
(c) Preverthat if lim’ f Gx) exists and him /"(x) exists, then lim_ f"(x) = 0: 
X¥— OO ESAS) XO 
(See also Problem 20-22.) 


Suppose that f and g are two diflerentiable functions which satisfy 
pee YF oe = 10.JProve' tliat af “flay = 0 and e(a) + 0,7 them fi.) = 0 tor all x. 
in an interval around a. Hint: On any interval where f/g is defined, show 
that it is constant. 


Suppose that | f@) — fG)| < |x — y|" fora > 1. Prove that f is constant by 
considering f’. Compare with Problem 3-20. 


A function f is Lipschitz of order a at x if there is a constant C such that 
(4) TO) — AO) Seley" 


for all y in an interval around x. The function f is Lipschitz of order ce on an 
interval if (*) holds for all x and y in the interval. 


(a) If f is Lipschitz of order a > 0 at x, then f is contmuous at x. 

(b) If f is Lipschitz of order a > 0 on an interval, then f is uniformly 
continuous on this interval (see Chapter 8, Appendix). 

(c) If f is differentiable at x, then f is Lipschitz of order | at x. Is the 
converse true? 

(d) If f is differentiable on [a,b], is f Lipschitz of order | on [a, b]? 

(ec) If f is Lipschitz of order a > 1 on [a,b], then f is constant on [a,b]. 


38. 


a9: 


40. 


41. 


43. 


44. 


45. 
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Prove that if 


then 
ag tayx ++-+++a,x" =0 


for some x in (0, 1). 


Prove that the polynomial function fi, (x) = x3 —3x +m never has two roots 
in [O, I], no matter what m may be. (This is an easy consequence of Rolle’s 
Theorem. It is mstructive, after giving an analytic proof, to graph fo and fo, 
and consider where the graph of f,, lies m relation to them.) 


Suppose that f is continuous and differentiable on [0,1], that f(x) is in 
{0, 1] for each x, and that f’(x) 4 1 for all x in [0, 1]. Show that there is 
exactly one number x in [0,1] such that f(x) = x. (Half of this problem 
has been done already, in Problem 7-1! 1.) 


~ * 9) “rR n ~ . 

(a) Prove that the function f(*) = x° — cos. satisfies f(x) = 0 for precisely 

two numbers x. 
\ . + . 9) . 
(b) Prove the same for the function f(x) = x° — xsmx — cosx. 
: . . a) . D 
*(c) Prove this also for the function f(x) = 2x“ — xsinx — cos“ x. (Some 

preliminary estimates will be useful to restrict the possible location of the 
zeros of f.) 


(a) Prove that if f is a twice differentiable function with f(0) = 0 and 
f(htaeand yO (1) = 0, then! | ("(x)| = 44for some -¢ inn (0, 1). 
In more picturesque terms: A particle which travels a unit distance in 
a unit time, and starts and ends with velocity 0, has at some time an 
acceleration > 4. Hint: Prove that either f”(x) > 4 for some x in (0, 5), 
or else f”(x) < —4 for some x in G. Ie): 

(b) Show that im fact we must have | f”(+)| > 4 for some x m (0, 1). 


Suppose that f is a function such that f’(v) = 1/x for all x > O and f(1) = 
0. Prove that f(xy) = f(x) + fQ) for all x, y > O. Hint: Find g’(x) when 
CC fey) 


Suppose that f satisfies 
f° @) + fs) — fx) = 0 


for some function g. Prove that if f is 0 at two pomts, then f is 0 on the 
mterval between them. Hint: Use Theorem 6. 


Suppose that f 1s continuous on [a,b], that it 1s n-times differentiable on 
(a, b), and that f(x) = 0 for n+1 different x in [a,b]. Prove that f“ (x) = 0 
for some x m (a, b). 
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46. 


47. 


48. 


49. 


50. 
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Lét aie ¥n41 be arbitrary points in [a, db], and let 
n+l 
O(x) = [[@ = Xz) 
n=l 


Suppose that f is (2 + 1)-times differentiable and that P is a polynomial 
function of degree < jm such that P(x;)) —w/ (form = 1)... wl (see 
Problem 3-6). Show that for each x in [a,b] there is a number c in (a, b) 
such that 
you Vic) 
x) — P(x) = O(x) -——_.. 

f(x) — P@) = O() mari 


Hint: Consider the function 


FO). = COAfO> FO) = COOLIO = ROoI. 
Show that F is zero at n + 2 different poimts in [a,b], and use Problem 45. 


Prove that 


5 < V66-—8 < f 
(without computing V66 to 2 decimal places!). 


Prove the following slight generalization of the Mean Value Theorem: If f 
is continuous and differentiable on (a,b) and lim f(y) and lim f(y) exist, 
yo b- 


yoat 


then there ts some x in (a, b) such that 


lim f(y) — lm f(y) 

Fe yob- y>at 

fa) = ——_. 

va 

(Your proof should begin: “This is a trivial consequence of the Mean Value 


Theorem because ... ”.) 


Prove that the conclusion of the Cauchy Mean Value ‘Theorem can be written 
in the form 


fb) = fla) _ f°) 
g(b) — g(a) g(x) ; 


under the additional assumptions that g(b) 4 g(a) and that f’(x) and g’(x) 


are never simultaneously 0 on (a,b). 


Prove that if f and g are continuous on [a,b] and differentiable on (a. 5), 
and g(x) 4 0 for x in (a. b), then there is some x in (a, b) with 


of OS @) 
g(x) g(b) — g(x) 


Hint: Multiply out first, to see what this really says. 


What is wrong with the following use of PHopital’s Rule: 


(The limit is actually —4.) 


52. 
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Find the following limits: 


. 5 Xx 
G) lm 
x0 tan x 
S: 
7 . cose x— | 
Gi) = lim ———— 
x>0 Ett 


Find f’(O) if 
g(x) 
16) = x 
0, =U 


andi 2(O)i= g’ (O)mes Onan (0) =a 7. 


Prove the following forms of PH6pital’s Rule (none requiring any essentially 
new reasoning). 


NTE Ce lim, g(x) = Qs and him. (a) et). ee thien 


8 ep 


lin f(x)/g(x) =/ (and similarly for limits from below). 


(Disk fe lina of Go) == lim gic) =" OF and lim ete) /2 Og) = co, then 


xXx—a 


lim f(x)/g(x) = oo (and similarly for —oo, or if x — a 1s replaced 
ae > at orx >a’). 

(c) If lm f(@) = lm g(@) = O, and lim f’(x)/g’(x) = J, then 
lim fO/g) Ls (and similarly ieee Hint: Consider 
dim f(1/x)/g 0/2). 

(d) If fim Pons = fim g(x) lL wants dim T(O)/2 @ = econhen 


lm f@)/2Gaiesoc: 
POS 
There is another form of ?H6pital’s Rule which requires more than algebraic 
manipulations: I fine fj). lun g(x) = co, and? lm f'@)/g'(x) = 7, 
X27 CO eke ene! X— 00 
then lm f(+)/g(x) =1. Prove this as follows. 
X—00 


(a) For every € > 0 there is a number a such that 


ata ites care 


Apply the Gauchy Mean Value Theorem to f and g on [a,x] to show 
that 
Lada f (@) 


—!/ 
(a = O(d) 


aie MONG cane 


(Why can we assume g(x) — g(a) 4 0?) 
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(b) Now write 
FQ) f@)—-f@ fe) g@)—8@) 
g(x) g(x) — gla) f@)— f@ 8(x) 


(why can we assume that f(x) — f(a) 4 0 for large x?) and conclude 
that 


aes) 
g(x) 


— oh 


<2e for sufficiently large x. 


‘To complete the orgy of variations on !Ho6pital’s Rule, use Problem 55 to 

prove a few more cases of the following general statement (there are so many 

possibilities that you should select just a few, if any, that interest you): 

If tun f Cai Sslnmeai = {  } and lim f’Gc)/2e’) = Ce), thenmlim 
x>[ | x>[ | x>| | x= | 

da(X)/ 30) —" (ene | || Ganebe @ OG a* or aor oO or —oo, and{ } 

can be 0 or 00 or —o0, and (__) can be / or o0 or —oo. 


If f and g are differentiable and lim f(x)/g(x) exists, does it follow that 
Xa 


lin f'(x)/g'(x) exists (a converse to FHo6pital’s Rule)? 
Xt 


Prove that if f’ is increasing, then every tangent line of f intersects the graph 
of f only once. (In particular, this is true for the function f(x) = x" if nis 
even.) 


Redo Problem 10-18 (c) when 


op == 
a 


(Why is this problem ts this chapter?) 


(a) Suppose that f 1s differentiable on [a,b]. Prove that if the mmimum 
of f on [a, b] is at a, then f’(a) = 0, and if it is at b, then f’(b) < 0. 
(One half of the proof of Theorem 1 will go through.) 

(b) Suppose that f’(a) < 0 and f’(b) > 0. Show that f’(x) = 0 for some x 
in (a,b). Hint: Consider the minimum of f on [a,b]; why must it be 
somewhere in (a, b)? 

(c)" Prowe thatit- (aye c= f’(b), then (x) —etorsomex in (a5). ( Mhis 
result is known as Darboux’s Theorem. Note that we are noé assuming 
that f’ is continuous.) Hint: Cook up an appropriate function to which 
part (b) may be apphed. 


Suppose that f 1s differentiable in some mterval containing a, but that f” is 
discontinuous at a. Prove the following: 


(a) ‘The one-sided limits lin f’(x) and lim f’(x) cannot botli exist. (This 


te Ca roa 
is just a minor variation ou ‘Vheorem 7.) 
(b) ‘Vhese one-sided limits caunot both exist even if we allow linits with the 
value +00 or —oo. Hint: Use Darboux’s ‘Theorem (Problem 60). 


a02: 
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It is easy to find a function f such that |f| is differentiable but f is not. 
For example, we can choose f(x) = | for x rational and f(x) = —1 for 
x irrational. In this example f is not even continuous, nor is this a mere 
coincidence: Prove that if | f| 1s differentiable at a, and f is continuous at a, 
then f is also differentiable at a. Hint: It suffices to consider only a with 
f(a) =0. Why? In this case, what must | f|’(@) be? 


(a) Letey 4 O and let *be sven. Prove that x” =+- y° = (-—F y)" only 
whenky =O) bint Ifo == y" = (Go + y) > apply Kolles Theorem to 
fQ@) =x" + y" —(«+ y)" on [0, xo}. 

(b) Prove that if y 40 and n is odd, then x” + y” = (x + y)” only if x = 0 
or x = —y. 


Suppose that f(0) = 0 and f’ is increasing. Prove that the function g(x) = 
f (x)/x is increasing on (0, 00). Hint: Obviously you should look at g’(x). 
Prove that it is positive by applying the Mean Value Theorem to f on the 
right interval (it will help to remember that the hypothesis f(0) = 0 is essen- 
tial, as shown by the function f(x) = 1+ x2). 
Use derivatives to prove that if n > 1, then 

Clea) > 1 7m for —l <x <Omnds0 < x 
(notice that equality holds for x = 0). 


Let f(x) =xtsin? 1/x for x #0, and let f(0) = 0 (Figure 33). 


(a) Prove that 0 is a local minum point for f. 


(b)” Proveathatyf (Oh=mfeatO) = 0. 


This function thus provides another example to show that Theorem 6 cannot 
be improved. It also illustrates a subtlety about maxima and minima that 
often goes unnoticed: a function may not be increasing in any interval to the 
right of a local minimum point, nor decreasing in any interval to the left. 


TOUS OST, B33 


(a) Prove that if f(a) > O and f’ 1s continuous at a, then f is increasing in 
some interval containing a. 


The next two parts of this problem show that continuity of f’ is essential. 


(b) If g(x) = x7 sin 1/x, show that there are numbers x arbitrarily close to O 
with g(x) = | and also with g(x) = —-1. 
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(c)~Suppose O® a < al. Let* f@) = ax + x? sin 1/x for x 4 0, and let 
f(O) = O (see Figure 34). Show that f is not increasing in any open 
interval containing 0, by showing that in any interval there are points x 
with f’(x) > 0 and also points x with f’(x) < 0. 


FIGURE 34 


The behavior of f for a > 1, which is much more difficult to analyze, 1s 
discussed in the next problem. 


#68. Let f(x) = ax +x7sin1/x for x 4 0, and let f(0) = 0. In order to find 
the sign of f’(x) when a@ > | it is necessary to decide if 2x sin I /x — cos 1 /x 
is < —1 for any numbers x close to 0. It is a little more convenient to 
consider the funetion g(y) = 2(sin y)/y — cos y for y 4 0; we want to know 
if g(v) < —1 forlarge y. This question is quite delicate; the most significant 
part of g(y) is — cos y, which does reach the value —1, but this happens only 
when sin y = Q, and it is not at all clear whether g itself can have values 
< ~—]. The obvious approach to this problem is to find the local mimimum 
values of g. Unfortunately, it is impossible to solve the equation g/(y) = 0 
expheitly, so more ingenuity 1s required. 


(a) Show that if g’(y) = 0, then 


2-—y° 
a : 


~ 


cos y = (sin y) 
and eonclude that 


2(*) = (sity) 


7769, 
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(b) Now show that if g’(y) = 0, then 


Ay? 


8 
tS — ——— . 
- 4+ y4 


and conclude that 
2+ ° 


e@)| = =. 
yA 


(c) Using the fact that (2 + y?)/V 4+ y* > 1, show that if aw = 1, then f is 
not mcreasing in any interval around 0. 


‘ ee rs 3 > ‘ 
(d) Using the fact that lim (2 + y?)/V4 + yt = 1, show that if a > 1, then 
yoo 
f is increasing in some interval around 0. 
A function f is increasing at a if there is some number 6 > 0 such that 


T (Gece alG ells 0 <a co Ga 
and ? 

TOy= fla Ww a6 =X -<m@ 
Notice that this does not mean that f is increasing in the interval (a — 6, 
a +4); for example, the function shown in Figure 34 is increasing at 0, but 
is Not an increasing function m any open mterval containing 0. 


(a) Suppose that f 1s continuous on [0,1] and that f is increasing at a for 
every a in [0, 1]. Prove that f is mcreasing on [0, 1]. (First convince 
yourself that there is something to be proved.) Hint: For 0 < b < 1, 
prove that the minimum of f on [b, 1] must be at b. 

(b) Prove part (a) without the assumption that f 1s contmuous, by consider- 
ing for each b in [0, 1] the set S, = {x : f(y) = f() for all y in [b, x]}. 
(This part of the problem is not necessary for the other parts.) Hint: 
Prove that Sp = {x : b < x < 1} by considering sup Sy). 

(c) If f is increasing at a and f is differentiable at a, prove that f’(a) > 0 
(this is easy). 

(d) If f’(a) > 0, prove that f is increasing at a (go right back to the definition 
of f’(a)). 

(ec) Use parts (a) and (d) to show, without using the Mean Value Theorem, 
that if f is continuous on [0, 1] and f’(a) > 0 for all a m [0. 1], then f 
is mcreasing on [0, 1]. 

(f) Suppose that f is continuous on [0,1] and f’(a) = 0 for all a in (0, 1). 
Apply part (e) to the function g(x) = f(x) + ex to show that f(1) — 
f (0) > —e. Similarly, show that f(1) — f(0) < e by considering h(x) = 
ex — f(x). Conclude that f(O) = fl). 


This particular proof that a function with zero derivative must be constant has 


many points in common with a proof of H. A. Schwarz, which may be the 
first rigorous proof ever given. Its discoverer, at least, seemed to think it was. 
See his exuberant letter in reference [54] of the Suggested Reading, 
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(a) 


(a) 


If f is a constant function, then every point is a local maximum pomt 
for f. It is quite possible for this to happen even if f is not a constant 
funation: for example, f f(x) ="Oifem %e—<e0) and f(4) = lstorm = 
0. But prove, using Problem 8-4, that if f is contimuous on [a,b] and 
every point of [a,b] is a local maximum point, then f is a constant 
function. The same result holds, of course, if every point of [a,b] is a 
local minimum point. 

Suppose now that every point is either a local maximum or a local mini- 
mum point for the continuous function f (but we don’t preclude the pos- 
sibility that some points are local maxima while others are local minima). 
Prove that f is constant, as follows. Suppose that f(ag) < f(bo). We 
can assume that f(ag) < f(x) < f(bo) for ag < x < bo. (Why?) Using 
Theorem | of the Appendix to Chapter 8, partition [ag, bg] mto intervals 
on which sup f — inf f < (f(b9) — f(ao))/2; also choose the lengths of 
these intervals to be less than (bg —ag)/2. ‘Then there is one such interval 
[a;, by] with ag < ay < by < bo and f(a;) < f(b,). (Why?) Continue 
inductively and use the Nested Interval Theorem (Problem 8-14) to find 
a point x that cannot be a local maximum or minimum. 


A point x is called a strict maximum point for f on Aif f(x) > f(y) 
for all y in A with y # x (compare with the definition of an ordinary 
maximum point). A local strict maximum point is defined in the 
obvious way. Find all local strict maximum poits of the function 


0, x irrational 
fay=4 ! P. 

—, x =-— m lowest terms. 

q q 
It seems quite unhkely that a function can have a local strict maximum 
at every point (although the above example might give one pause for 
thought). Prove this as follows. 
Suppose that every pomt is a local strict maximum point for f. Let 
x; be any number and choose a; < x; < by; with b; — ay < | such 
that f(x) see ator all x im i[a), 0). alet 1. A~egebe any pom 
(a,, by) and choose ay < az < x2 < bz < by with bz — a2 < § such that 
f (x2) > f(x) for all x in [a2,b2]. Continue m this way, and use the 
Nested Interval Theorem (Problem 8-14) to obtain a contradiction. 


FIGURE | 


(a, f(a)) 


DEFINITION 1 


(b, f(b) 


DEFINITION 2 
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APPENDIXG PCONVEXITY AND CONCAVITY, 


Although the graph of a function can be sketched quite accurately on the basis 
of the information provided by the derivative, some subtle aspects of the graph are 
revealed only by examining the second derivative. These details were purposely 
omitted previously because graph sketching is complicated enough without wor- 
rying about them, and the additional information obtained is often not worth the 
effort. Also, correct proofs of the relevant facts are sufficiently difficult to be placed 
in an appendix. Despite these discouraging remarks, the information presented 
here is well worth assimilating, because the notions of convexity and concavity are 
far more important than as mere aids to graph sketching. Moreover, the proofs 
have a pleasantly geometric flavor not often found in calculus theorems. Indeed, 
the basic definition is geometric in nature (see Figure 1). 


A function f is convex on an interval, if for all a and b in the interval, the line 


segment joming (a, f(a)) and (b, f(b)) hes above the graph of f. 


The geometric condition appearing in this definition can be expressed in an 
analytic way that is sometimes more useful in proofs. ‘The straight hne between 


(a, f(a)) and (b, f(b)) is the graph of the function g defined by 


_ Agia, 
a a 


(x) ae 


x —a)+ f(a). 


This line hes above the graph of f at x if g(x) > f(x), that 1s, if 


na). f(a) aha) 
b-a 
Or 
b — 
LS I) —a) > f(x)— f(a) 
b—a 
or 


LOSS Je) I@ 


b—a x-—a 


We therefore have an equivalent definition of convexity. 


A function f is convex on an interval if for a, x, and b in the interval with 
a <x <bwe have 


TO Ee a nies ACE) 


x-—a b—a 
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(a, f(a)) 


Le Gy eRe E ee 


(b, f (b)) 


THEOREM 1 


PROOF 


If the word “over” in Definition 1 is replaced by “under” or, equivalently, if the 
inequality in Definition 2 is replaced by 


Lehto) atl) RO) te AE 


Xx—-—a b—-—a 


we obtain the definition of a concave function (Figure 2). It is not hard to see that 
the concave functions are precisely the ones of the form —f, where f is convex. 
For this reason, the next three theorems about convex functions have immediate 
corollaries about concave functions, so simple that we will not even bother to 
state them. 

Figure 3 shows some tangent lines of a convex function. Two things seem to be 
(We: 


(1) ‘The graph of f hes above the tangent line at (a, f(a)) except at the point 
(a, f(a)) itself (this point is called the point of contact of the tangent Iie). 

(2) If a < b, then the slope of the tangent line at (a, f(a)) is less than the slope 
of the tangent line at (b, f(b)); that ts, f’ is increasing. 


As a matter of fact these observations are true, and the proofs are not difficult. 


FIGURE 3 


Let f be convex. If f is differentiable at a, then the graph of f lies above 
the tangent line through (a, f(a)), except at (a, f(a)) itself. If a < b and f is 
differentiable at a and b, then f’(a) < f'(b). 


If 0 < hy < My, then as Figure 4 indicates, 


(1) fia +h) — f(a) x Jia-+ ta) — ja) 

hy ho , 
A nonpictorial proof can be derived immediately from Definition 2 applied to 
a<a+h, <a+hy. Inequality (1) shows that the values of 


f(a th) — fla) 
h 
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ath, a+h) 


FIGURE 4 


decrease as h > 0*. Consequently, 


fla+h) — f(a) 
h 
(in fact f’(a) is the greatest lower bound of all these numbers). But this means that 
for h > 0 the secant line through (a, f(a)) and (a +h, f(a +h)) has larger slope 
than the tangent line, which implies that (a +h, f(a +h)) lies above the tangent 
line (an analytic translation of this argument is easily supplied). 
For negative fA there is a similar situation (Figure 5): if hz < hy < 0, then 
fla+h,)— fla) f(a+ho2)— fla) 
hy Zi h 


f(a) < forh > 0 


This shows that the slope of the tangent line is greater than 
fla+h)— f(a) 
h 


(in fact f’(a) is the least upper bound of all these numbers), so that f(a + h) lies 
above the tangent line if h < 0. This proves the first part of the theorem. 


forh <0 


ath, a+t+h, 


FIGURE § 
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ee 
ax, Xo b 
FIGURE 7 
LEMMA 
PROOF 


FIGURE 6 


Now suppose that a < b. Then, as we have already seen (Figure 6), 


fla +(b—a)) — fla) 


f(a) < since b—a > 0 
b—a 
ge f(b) — fla) 
~ b-—a 
and 
b —b))— fil 
f(b) > Lie Sse 8 since a—b < 0 
a—hb 
_ fla)— f(b) _ flb)— fl@ 
S a—b ‘ b-—a 


Combining these inequalities, we obtain f’(a) < f’(b). | 


Theorem | has two converses. Here the proofs will be a little more difficult. 
We begin with a lemma that plays the same role in the next theorem that Rolle’s 
Theorem plays in the proof of the Mean Value Theorem. It states that iff’ 
is increasing, then the graph of f lies below any secant line which happens to be 
horizontal. 


Suppose f is differentiable and f’ is increasing. If a < b and f(a) = f(b), then 
f(x) < fla) = f(b) fora < x <b. 


Suppose that f(x) > f(a) = f(b) for some x in (a,b). Then the maximum of 
f on [a,b] occurs at some point xp in (a,b) with f(xo) > f(a) and, of course, 
f'(xo) = 0 (Figure 7). On the other hand, applying the Mean Value ‘Theorem to 
the interval [a,xo|, we find that there is x; with a < x, < x9 and 


_ f(xo) — fla) s 
xo —a ee 


f(xy) 0), 


contradicting the fact that f’ is increasing. J 


FIGURE 8 


THEOREM 2 


PROOF 


THEOREM 3 


PROOF 
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We now attack the general case by the same sort of algebraic machinations that 
we used in the proof of the Mean Value ‘Theorem. 


If f is differentiable and f’ is increasing, then f is convex. 


Let a. <b "Denne = by 


Ip) = 
gee = ff) — TB, =u). 


b=a 


It is easy to see that g’ 1s also increasing; moreover, g(a) = g(b) = f(a). Applying 
the lemma to g we conclude that 


Ae eo (Ch Mee Nene). 


In other words, if a < x < b, then 


tO) oy 


f(x)- x—a) < f(a) 
b-—a 
or 
Oo aia@) u f(b) — fla) 
x-a b-—a 


Hence f is convex. J 


If f is differentiable and the graph of f lies above each tangent line except at the 
point of contact, then f is convex. 


Let a <b. It is clear from Figure 8 that if (b, f(b)) les above the tangent line at 
(a, f(a)), and (a, f(a)) les above the tangent line at (b, f(b)), then the slope of 
the tangent line at (b, f(b)) must be larger than the slope of the tangent line at 
(a, f(a)). The following argument just says this with equations. 

Since the tangent line at (a, f(a)) is the graph of the function 


g(x) = f'(a(x —a) 4+ fla), 
and since (b, f(b)) lies above the tangent line, we have 
DO yf (ab — a) + fia). 
Similarly, since the tangent line at (b, f(b)) is the graph of 
h(x) = f'(b)(x —b) + f(b), 
and (a. f(a)) hes above the tangent line at (b, f(b)), we have 
(2) fla)> f'(b\(a—b)+ fb). 
It follows from (1) and (2) that f’(a) < f’(b). 


It now follows from Theorem 2 that f is convex. J 


Ifa function f has a reasonable second derivative, the mformation given in these 
theorems can be used to discover the regions in which f is convex or coucave. 
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Consider, for example, the function 


ice = 5: 


1+x 


For this function, 
—2x 


1 axe 
Thus f’(x) = 0 only for x = 0, and f(0) = 1, while 


FOUISO it LeSo: 
foes wf xo: 


Moreover, 


F(x):= 0 forall x; 
f(x) —~ 0 asx > wor -o~, 
f is even. 


FIGURE 9 


The graph of f therefore looks something like Figure 9. We now compute 


(1 +x7)?(—2) + 2x - [2(1 +x?) - 2x] 


Lae (1 +x?) 


~ 2(3x2— 1) 
7 4+ e2)2- 


It is not hard to determine the sign of f”(x). Note first that f”(x) = 0 only when 
x = V1/3 or —/1/3. Since f” is clearly continuous, it must keep the same sign 


on each of the sets 


(—oo, -—/ 1/3), 
(—J/ 1/3, V1/3), 


(V 1/3, 00). 
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Since we easily compute, for example, that 


1) — 5>0, 
f'() = —2 < 0, 
ie OH) oS (0 


we conclude that 


f’ > 0 on (—o00, —,/1/3) and (/1/3, 00), 
pf = Ofont(—./ 1739/1773"): 


Since f” > 0 means f’ is increasing, it follows from Theorem 2 that f is convex on 


(—oo, —V 1/3) and (JV 1/3, 00), while on (— 1/3, V 1/3) f is concave (Figure 10). 


f is convex f is concave f is convex 


vin | iB 


FIGURE 10 


Notice that at (/ 1/3, 3) the tangent line les below the part of the graph to the 


right, since f is convex on (1/3, 00), and above the part of the graph to the left, 
since f is concave on (—/ 1/3, V 1/3); thus the tangent line crosses the graph. In 


general, a number a 1s called an inflection point of / if the tangent line to the 


graph of f at (a, f(a)) crosses the graph; thus 173 and a are inflection 
points of f(x) = 1/(1 + x7). Note that the condition f”(a) = 0 does not ensure 
that a is an inflection point of f; for example, if f(x) = x*, then f”(0) = 0, but 
f is convex, so the tangent line at (0, 0) certainly doesn’t cross the graph of f. In 
order to conclude that a is an inflection point of a function f/, we need to know 
that f” has different signs to the left and right of a. 


This example illustrates the procedure which may be used to analyze any func- 
tion f. After the graph has been sketched, using the information provided by /’, 
the zeros of f” are computed and the sign of f” is determined on the intervals 
between consecutive zeros. On intervals where f” > O the function is convex; 
on intervals where f” < 0 the function is concave. Knowledge of the regions of 
convexity and concavity of f can often prevent absurd misinterpretation of other 
data about f. Several functions, which can be analyzed in this way, are given in 
the problems, which also contain some theoretical questions. 
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‘Lo round out our discussion of convexity and concavity, we will prove one further 
result that you may already have begun to suspect. We have seen that convex and 
concave functions have the property that every tangent hne intersects the graph 
Just once; a few drawings will probably convince you that no other functions have 
this property, but the only proof I know is rather tricky. 


THEOREM 4 If / is differentiable on an interval and intersects each of its tangent lines just 
once, then f is either convex or concave on that interval. 


PROOF ‘There are two parts to the proof. 
(1) First we claim that no straight line can intersect the graph of f in three different 
flo) points. Suppose, on the contrary, that some straight ne did intersect the graph 
of f at (a, f(a)), (6, f(b)) and (c, f(c)), with a < b < c (Figure 11). Then we 
f(b) would have . 
(1) SO) mae Oe i) 
HAD b—a c-a 
Consider the function 
i t 
(x) — f(a) Re mE 
_ a En) = Jt) = Ka) {ona im |>ye. 
x-a 
Equation (1) says that g(b) = g(c). So by Rolle’s Theorem, there is some number x 
FIGURE 1] in (b,c) where 0 = g’(x), and thus 


0=-a2 say Ga) 


or 
m slap alam ia Ge) 


X= 4 


Nhe) 


But this says (Figure 12) that the tangent hne at (x, f(¥)) passes through (a, f(@)), 
contradicting the hypotheses. 


(2) Suppose that ag < bg < cg and ay < by < cy are pomts in the interval. Let 


xX, = (1 —ft)ag + tay 
esl — eager) 0 =<%, =a1r 
al = 2a Ewe) 


PIP BORIS 12 


Then x9 = a9 and x; = a; and (Problem 4-2) the points x; all he between ap 
and a;, with analogous statements for y, and z;. Moreover, 


ee <M, <7, for 0 =e _ Ie 


Now consider the function 

fod- £6)  fedn— Fd 
Wy — XY bf gta 

By step (1), ¢(@) 4-0 for all nim (0,1). So either g(t) > O for all ¢ m [0, 1] or 

g(t) < 0 for all 7 in [0, 1]. Thus, either f is convex or f is concave. J 


(= for 0 =": 
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PROBLEMS 


IN 


*6. 


re 


Sketch, indicating regions of convexity and concavity aud points of inflection, 
the functions in Problem | 1-1 (consider (iv) as double starred). 


Figure 30 im Chapter 11 shows the graph of f’. Sketch the graph of /f. 


Show that f is convex on an interval if and only if for all x and y in the 
interval we have 


f@x+(l—fty) <tf@)+d-af(y). for0<t <1. 
(This is just a restatement of the definition, but a useful one.) 


(a) Prove that if f and g are convex and f 1s increasing, then fog is convex. 
(It will be easiest to use Problem 3.) 

(b) Give an example where g o f is not convex. 

(c) Suppose that f and g are twice differentiable. Give another proof of the 
result of part (a) by considering second derivatives. 


(a) Suppose that f is differentiable and convex on an interval. Show that 
either f is increasing, or else f is decreasing, or else there is a number c 
such that f is decreasing to the left of ¢ and increasing to the right of c. 

(b) Use this fact to give another proof of the result in Problem 4(a) when f 
and g are (one-time) differentiable. (You will have to be a little careful 
when comparing f’(g(x)) and f’(g()) for x < y.) 

(c) Prove the result mm part (a) without assuming f differentiable. You will 
have to keep track of several different cases, but no particularly clever 
ideas are needed. Begin by showing that if a < b and f(a) < f(b), then 
f is increasing to the right of b; and if f(a) > f(b), then f is decreasing 
to the left of a. 


Let f be a twice-differentiable function with the following properties: 
fae oO Nore Oe and fms idecreasing, ands /(0)c— 0 Prose that 
f(x) = 0 for some x > 0 (so that in reasonable cases f will have an inflec- 
tion point at x —an example is given by f(x) = 1/(1+x7)). Every hypothesis 
in this theorem is essential, as shown by f(x) = 1 — x7, which is not positive 
lOmaleweays Cx) = x?, which is not decreasing; and by" f (x) =_1/ Oe l4; 
which does not satisfy f’(0) = 0. Hint: Choose x9 > 0 with f’(x9) < 0. We 
cannot have f’(y) < f’(xo) for all y > xo. Why not? So f’(x1) > f’(xo) for 
some x; > x9. Consider f’ on [0, x1]. 


(a) Prove that if f is convex, then f([« + y]/2) < (f@) fQ)]/2. 

(b) Suppose that f satisfies this condition. Show that f(kx + (i —k)y) < 
kf (x) + (1 —k) f(y) whenever & is a rational number, between 0 and 1, 
of the form m/2”. Hint: Part (a) is the special case n = 1. Use induction, 
employing part (a) at each step. 

(c) Suppose that f satisfies the condition im part (a) and f is continuous. 
Show that f 1s convex. 
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H 
*8.  Forn > 1, let py,..., px be positive numbers with 3 Die 


i=] 
n 


(a) For any numbers x), ...,X, show that ys pix; hes between the smallest 


and the largest x;. i=l 
n—l n—l 
(b) Show the same for (1/1) SS Deiwherer = yy Di. 
pl i=! 
n H 
(c) Prove Jensen’s inequality: If f is convex, then f py psi) < yy Did Ode 
Hint: Use Problem 3, noting that p, = 1—t. fan (b) is Aeatiad to show 
n—l 
that (1/t) »E Pees in the domainvol 7 if x).4- 4, x, ane. 
i=l 


*9, (a) Forany function f, the right-hand derivative, him [f(a+h)— fla )]/ 1mis 


denoted by f(a), and the left-hand derivative is denoted by f’(a). The 
proof of Theorem | actually shows that f{ (a) and f! (a) always exist if 
f is convex on some open interval containing a. Check this assertion, 
and also show that f; and f! are increasing, and that f’(a) < f{ (a). 

(b) Conversely, suppose that f is convex on [a, b] and g is convex on [b,c], 
with f(b) = g(b) and f'(b) < g'(b) (Figure 13(a)). If we define h 
on [a,c] to be f on [a,b] and g on [b,c], show that h is convex on 

g [a,c]. Hint: Given P and Q on opposite sides of O = (b, f(b)), as in 
Figure 13(b), compare the slope of OQ with that of PO. 

(c) Show that if f is convex, then fj (a) = f/(@) if and only if f is con- 
tinuous at a. (Thus f is differentiable precisely when f{ 1s continuous.) 
Hint: [f (6) — f(a)]/(b — a) is close to f’(a) for b < a close to a, and 
f(b) is less than this quotient. 


a b c *10. (a) Prove that a convex function on R, or on any open interval, must be 
continuous. 

(a) (b) Give an example of a convex function on a closed interval that is not 

continuous, and explain exactly what kinds of discontinuities are possible. 


11. Calla function f weakly convex on an interval if for a < b < c mm this interval 
we have 
fO)-f@ _ f)—f@ 


x—a a b—-—a 


(a) Show that a weakly convex function is convex if and only if its graph 
———k oe contains no straight line segments. (Sometimes a weakly convex function 
g b G is simply called “convex,” while convex functions in our sense are called 


(b) “strictly convex”) 
FIGURE 13 (b) Reformulate the theorems of this section for weakly convex functions. 


12. 


13. 
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Find two convex functions f and g such that f(x) = g(x) if and only if x 
is an integer. Hint: First find an example where g is merely weakly convex, 
and then modify it, using the result of Problem 9 as a guide. 


A set A of points in the plane is called convex if A contains the line segment 


joining any two points in it (Figure 14). For a function f, let Ay be the set 


of points (x, y) with y > f(x), so that A, is the set of points on or above 
the graph of f. Show that Ay is convex if and only if f is weakly convex, 
in the terminology of the previous problem. Further information on convex 
sets will be found in reference [18] of the Suggested Reading, 


(a) a convex subset of the plane (b) a non-convex subset of the plane 


FIGURE 14 


CHAPTER 


DEFINITION 


INVERSE FUNCTIONS 


We now have at our disposal quite powerful methods for investigating functions; 
what we lack is an adequate supply of functions to which these methods may 
be applied. We have studied various ways of forming new functions from old— 
addition, multiphcation, division, and composition-—but using these alone, we can 
produce only the rational functions (even the sine function, although frequently 
used for examples, has never been defined). In the next few chapters we will 
begin to construct new functions in quite sophisticated ways, but there is one 
important method which will practically double the usefulness of any other method 
we discover. 

If we recall that a function is a collection of pairs of numbers, we might hit upon 
the bright idea of simply reversing all the pairs. Thus from the function 


f={( 2) handle leno, 


we obtain 


g={Caly, Gyre! Syris als) | 
While @Qpr= Zand: 43) = 4, weshaveso(2)e=eleand a4) — 3, 


Unfortunately, this bright idea does not always work. If 
f =1 (1,2), G4 eS), eae 


then the collection 


1 (2, 1), Gash Oxs) eGalan} 


is not a function at all, since it contams both (4, 3) and (4, 13). It is clear where 
the trouble hes: f(3) = f(13), even though 3 4 13. This is the only sort of thing 
that can go wrong, and it is worthwhile giving a name to the functions for which 
this does not happen. 


A function f 1s one-one (read “one-to-one”) if f(a) # f(b) whenever a ¥ b. 


The identity function / is obviously one-one, and so is the followmg modifica- 


tion: 
x; OPS 
ey) = (Say 
Sm hea 


The function f(x) = x? is not one-one, since f(—1) = f(1), but if we define 
4 
ek) — es ners 
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(and leave g undefined for x < Q), then g is one-one, because g is increasing (since 
g'(x) = 2x > 0, for x > 0). This observation is easily generalized: If n is a natural 
number and 


i=, x= Oo 
then f is one-one. If n is odd, one can do better: the function 


Po — x" for all x 


is one-one (since f’(x) = nx"! > 0, for all x 4 0). 

It is particularly easy to decide from the graph of f whether f is one-one: the 
condition f(a) # f(b) for a 4 b means that no horizontal line intersects the graph 
of f twice (Figure 1). 


a one-one function a function that is not one-one 


(a) (b) 


RUG Ui Reet 


If we reverse all the pairs in (a not necessarily one-one function) f we obtain, in 
any case, some collection of pairs. It is popular to abstain from this procedure un- 
less f is one-one, but there is no particular reason to do so— instead of a definition 
with restrictive conditions we obtain a definition and a theorem. 


For any function f, the inverse of f, denoted by f~}, is the set of all pairs 


(a, b) for which the pair (b, a) is in f. 


f—' is a function if and only if f is one-one. 


Suppose first that f is one-one. Let (a,b) and (a,c) be two pairs in ok Then 
(b,a) and (emq@) are in f, so @ = f(b) and a,.= f(c); since f is ont-one this 
imphes that b= c. Thus f7! is a function. 

Conversely, suppose that a’ is adunction Wt Fb) 7 (ey ahen f contains 
the pairs (b, f(b)) and (c, f(c)) = (c, f(b)), so (f(b), b) and (f(b), c) are in me 


Since f~! is a function this imphes that b = c. Thus f is one-one. ff 
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The graphs of f and f~! are so closely related that it is possible to use the 
graph of f to visualize the graph of f~!. Since the graph of f~! consists of all 
pairs (a, b) with (b, a) in the graph of f, one obtains the graph of f~! from the 
graph of f by interchanging the horizontal and vertical axes. If f has the graph 
shown in Figure 2(a), 
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This procedure is awkward with books and impossible with blackboards, so it is 
fortunate that there is another way of constructing the graph of f~'. The points 
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(a,b) and (b,a) are reflections of each other through the graph of /(x) = x, 
which is called the diagonal (Figure 4). To obtain the graph of f~! we merely 
reflect the graph of f through this line (Figure 5). 


(a, b) 

e Z 

wih 
_/diagonal 
(ad) 
e me 

ao : e(b, a) 

wae *(d.c) 


FIGURE 4 FG UiRaeS 


Reflecting through the diagonal twice will clearly leave us right back where we 
started; this means that (f~!)~! = f, which is also clear from the definition. In 
conjunction with Theorem 1, this equation has a significant consequence: if f 
is a one-one function, then the function see is also one-one (since (f Ls liiabe tars 
function). 

There are a few other simple manipulations with inverse functions of which you 
should be aware. Since (a,b) 1s in f precisely when (b, a) 1s in f —! it follows that 


b= f(a) means the same as a= oD): 


Thus f7~!(b) is the (unique) number a such that f(a) = b; for example, if f(x) = 
x3, then f~!(b) is the unique number a such that a? = b, and this number is, by 


definition, VB. 


The fact that f~!(x) is the number y such that f(y) = x can be restated in a 
much more compact form: 


f(f tx) =x, for all x in the domain of f7!. 


Moreover, 


fore lafiloe) axe for all x in the domam of /f; 


this follows from the previous equation upon replacing f by f~!. These two 
important equations can be written 


Poi a dk 
fvlof=l 
(except that the right side will have a bigger domain if the domain of f or f~! is 


not all of R). 
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FIGURE 6 


PG Wikekiey 


THEOREM 2 


PROOF 


Since many standard functions will be defined as the inverses of other functions, 
it is quite mportant that we be able to tell which functions are one-one. We have 
already hinted which class of functions are most easily dealt with—increasing and 
decreasing functions are obviously one-one. Moreover, if f is mcreasing, then f—! 
is also increasing, and if f is decreasing, then f~! is decreasing (the proof is lefi 
to you). In addition, f is increasing if and only if —/f is decreasing, a very useful 
fact to remember. 

It is certainly not true that every one-one function is either creasing or decreas- 
ing. One example has already been mentioned, and is now graphed in Figure 6: 


x, x 33,5 
(ep=a oe a =D 
je) 


Figure 7 shows that there are even continuous one-one functions which are neither 
mcreasmg nor decreasing. But if you try drawing a few pictures you will soon 
suspect that every one-one continuous function defined on an interval is either 
increasing or decreasing. 


If f is continuous and one-one on an interval, then f is either mcreasing or 
decreasing on that interval. 


‘The proof proceeds in three easy steps: 


(1) If a < b <c are three pomts im the interval, then 


either (i) (ay <a} (bye fe) 
or (11) f(a = f(b) = Flo): 


Suppose, for example, that f(a) < f(c). If we had f(b) < f(a) (Figure 8), then 
the Intermediate Value Theorem applied to the interval [b, c] would give an x with 


FGA 


b<x <cand f(x) = f(@), contradicting the fact that f is one-one on [a.c]. 
Sunilarly, f(b) > f(c) would lead to a contradiction, so f(a) < f(b) < f(c). 


Naturally, the same sort of argument works for the case f(a) > f(c). 


(2) If a<b<c <d are four pomts in the interval, then 


either (1) fla) <= f(b) = fe) = fd) 
or (ii) f(a) > fo) = f@)= [@. 


For we can apply (1) to a <b < ¢ and then to b <¢ <d. 


FIGURE 9 


FIGURE 10 


(& 


THEOREM 3 


PROOF 
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(3) ‘Take any a < b in the interval, and suppose that f(a) < f(b). Then f is 
increasing: For if c and d are any two points, we can apply (2) to the collection 
{a, b,c, d} (after arranging them in increasing order). J 


Henceforth we shall be concerned almost exclusively with continuous increasing 
or decreasing functions which are defined on an interval. If f is such a function, 
it is possible to say quite precisely what the domain of f7! will be like. 

Suppose first that f is a continuous increasing function on the closed interval 
[a,b]. Then, by the Intermediate Value Theorem, f takes on every value between 
f(a) and f(b). Therefore, the domain of f7! is the closed interval [ f(a), f(b)]. 
Similarly, if f is continuous and decreasing on [a,b], then the domain of f~! is 
[f(d). fla)]. 

If f is a continuous increasing function on an open interval (a,b) the analysis 
becomes a bit more difficult. To begin with, let us choose some point c in (a, db). 
We will first decide which values > f(c) are taken on by f. One possibility is that 
f takes on arbitrarily large values (Figure 9). In this case f takes on all values 
> fic), by the Intermediate Value Theorem. If, on the other hand, f does not 
take on arbitrarily large values, then A = { f(x) : c < x < b} is bounded above, 
so A has a least upper bound @ (Figure 10). Now suppose y is any number with 
f(c) < y < a. Then f takes on some value f(x) > y (because @ is the least 
upper bound of A). By the Intermediate Value Theorem, f actually takes on 
the value y. Notice that f cannot take on the value a itself; for if a = f(x) for 
a <x < band we choose ¢t with x <t < b, then f(t) > a, which is impossible. 

Precisely the same arguments work for values less than f(c): either f takes on 
all values less than f(c) or there is a number B < f(c) such that f takes on all 
values between B and f(c), but not £ itself: 

This entire argument can be repeated if f is decreasing, and if the domain of f 
is R or (a,oo) or (—o0, a). Summarizing: if f is a continuous increasing, or 
decreasing, function whose domain is an interval having one of the forms (a,b), 
(—oo, b), (a, 00), or R, then the domain of fu is also an interval which has one 
of these four forms, and we can easily fit the remaining types of intervals, (a, b], 
[a,b], (—cv, b], and [a, oo), into this discussion also. 

Now that we have completed this preliminary analysis of continuous one-one 
functions, it is possible to begin asking which important properties of a one-one 
function are inherited by its inverse. For continuity there is no problem. 


If f is continuous and one-one on an interval, then f~! is also continuous. 


We know by Theorem 2 that f is either increasing or decreasing. We might as 
well assume that f is increasing, since we can then take care of the other case by 
applying the usual trick of considering — f. We might as well assume our interval 
is Open, since it is easy to see that a continuous increasing or decreasing function 
on any interval can be extended to one on a larger open interval. 

We must show that 


lim f-'(@x) = f-'(b) 


xb 
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f(a)+6 


flaj=sb 


ERG ReE a Ie 


(f(a),a) ZA, 
Ux 2 


(a, f(a)) 


FIGURE 12 


for each b in the domain of f~!. Such a number db is of the form f(a) for some a 
in the domain of f. For any ¢ > 0, we want to find a 6 > 0 such that, for all x, 


if f(a) -—5 <x < f(a)+6, thena—e < mul) <at+é. 
Figure 11 suggests the way of finding 6 (remember that by looking sideways you 
see the graph of f~!): since 
(OIE a GRETA Sa ose 
it follows that 
f(a—e) < f(a) < flat+e); 


we let 6 be the smaller of f(a +e)— f(a) and f(a)— f(a—e). Figure 11 contains 
the entire proof that this 6 works, and what follows is simply a verbal account of 
the information contained in this picture. 

Our choice of 5 ensures that 


f(a—e) < f(a) —d and f(a) +6 < flat+e). 


Consequently, if 


f(a)-—5 <x < f(a)t+64, 


then 


fla—e)<x < f(a+é). 
Since f is increasing, f~! is also increasing, and we obtain 
f'(fla-e8)) < f(x) < f- (fa +e)). 


1.€., 
a-é< Tae <a+teé, 


which is precisely what we want. J 


Having successfully investigated continuity of f~!, it is only reasonable to tackle 
differentiability. Again, a picture indicates just what result ought to be true. Fig- 
ure 12 shows the graph of a one-one function f with a tangent line L through 
(a. f(a)). If this entre picture is reflected through the diagonal, it shows the graph 
of f~! and the tangent line L’ through (f(a), a). The slope of L’ is the reciprocal 
of the slope of L. In other words, it appears that 


53.8.0 well 
(f YON) = aay 


This formula can equally well be written in a way which expresses (f-')'®) di- 
rectly, for each b in the domam of ram 


oo 
ORS arr, 


Unlike the argument for continuity, this pictorial “proof” becomes somewhat 
involved when formulated analytically. “Vhere is another approach which might 


UG IRSE 3 


eye x3 
THEOREM 4 
PROOF 
fla) = Vx 
(b) 
THEOREM 5 
PROOF 
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be tried. Since we know that 


(ACiag@, ee 


it is tempting to prove the desired formula by applying the Chain Rule: 


! -1 =p a 
7 (Ff Bed) ) Gi) = 1, 

sO 

fF YQ) =>. 
Tia) 
Unfortunately, this is not a proof that f~! is differentiable, since the Chain Rule 
cannot be applied unless f~! is already known to be differentiable. But this argu- 
ment does show what (f~!)’(x) will have to be if f~! is differentiable, and it can 
also be used to obtain some important preliminary information. 


If f is a continuous one-one function defined on an interval and _f’(f~!(a)) = 0, 
then f~! is not differentiable at a. 


We have 
aes, 


If f~! were differentiable at a, the Chain Rule would imply that 


Fo ‘Gia Wi@ =), 


hence 


0-(f Y@=1, 


which is absurd. § 


A simple example to which Theorem 4 applies is the function f(x) = x?. Since 
f'(O) =0 and 0 = f~'(0), the function f~! is not differentiable at 0 (Figure 13). 

Having decided where an inverse function cannot be diflerentiable, we are now 
ready for the rigorous proof that in all other cases the derivative is given by the 
formula which we have already “derived” in two different ways. Notice that the 
following argument uses continuity of f~', which we have already proved. 


Let f be a continuous one-one function defined on an interval, and suppose that 
f is differentiable at f~'(b), with derivative f’(f~!(b)) 4 0. Then f7! is differ- 
entiable at b, and 


l 
oe ee 
y(b) = ————__.. 
Gi (b) Fo) 


Beeb f(a) Pliem 


mee a) f |) 
lim > 
h-O h 
; ¥ b+ —a 
= hm ———_- 
h>0 h 
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Now every number b +h in the domain of f~! can be written in the form 
b+h= f(a+k) 


for a unique k (we should really write k(h), but we will stick with k for simplicity). 


Then 


_ f \(b+h)—a 
ee 
h—0 h 
. FIGFaGTENV—s 
= hm ——————_ 
h-0 = f(a+k)—b 
k 


= im —————*—-: 
h>0 f(a+k)— f(a) 
We are clearly on the right track! It is not hard to get an explicit expression for k; 
since 
b+h= f(a+k) 
we have 
f-'(b+h) =a+k 

or 


k= f-'(b+h) — f(b). 


Now by Theorem 3, the function f~! is continuous at b. This means that k 
approaches 0 as h approaches 0. Since 


ey = : 
fim “A EOW = fila) = fF) £0. 


this implies that 


(f-')(b) = ll 


l 
PCE): 


The work we have done on inverse functions will be amply repaid later, but here 
is an immediate dividend. For n odd, let 


Stee for all x: 


for n even, let 


oS. x>0. 


Then f, is a continuous one-one function, whose inverse function is 


Bn(X) — Uy — xin. 
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By Theorem 5 we have, for x 4 0, 


F l 
25 (x) = Fifa) 
a l 
1G ey 
a | 
i oe 
A x !-d/n) 
= ib l/l 
n 


Thus, if f(+) = x“, and a is an integer or the reciprocal of a natural number, then 
f'(x) = ax*"!. It is now easy to check that this formula is true if a is any rational 
number: Let a = m/n, where m is an integer, and n is a natural number; if 


f(x) = xin/n = Ca ae 


then, by the Chain Rule, 


l 


Y m—l| e 
Fe) m(x!/") yg Glinyst 
n 
= Se : x Lon/m)—(C/m]+[/n)—1] 
il 
— mm (mjn)y—1 
n 


Although we now have a formula for f’(+) when f(x) = x% and a is rational, 
the treatment of the function f(x) = x® for irrational a will have to be saved 
for later—at the moment we do not even know the meaning of a symbol like x2, 
Actually, inverse functions will be involved crucially in the definition of x@ for 
irrational a. Indeed, in the next few chapters several important functions will be 
defined in terms of their inverse functions. 


PROBLEMS 


1. Find f~! for each of the following f. 


G) f@~)=x3 +1. 
(i) fX)=(@- 1)>. 
(ii) f(x) = | 


es x rational 
—x, Xx irrational. 


— x2 
(iv) f@)= | hs 
(Si ite) = erage Wea fa n—1| 


Gi Ok = 0x: 
(vi) ff) =x [x]. 
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10. 


11. 
"el: 
13. 


(vu) f(O.a;a2a3...) = 0.a2a)a3.... (Decimal representation is bemg used.) 


(vin) f(x) = — lie 
=n) 


Describe the graph of f~! when 


Gi)  (f is increasing and always positive. 

(1) f is increasing and always negative. 
(ii) f is decreasing and always positive. 
(iv) f is decreasing and always negative. 


Prove that if f is increasing, then so is f~', and similarly for decreasing 
functions. 


If f and g are increasing, is f + g? Or f-g? Or fog? 


(a) Prove that if f and g are one-one, then f o g is also one-one. Find 
(f og)~! in terms of f7! and g7!. Hint: The answer is not f7! 0 g7!. 


(b) Find g@! in terms of fe ei eo) 1 ae). 


x+l 
ear is one-one if and only if ad — be ¥ 0, and find 


Show that f(x) = 


+4. 


f—! in this case. 
On which intervals [a, b] will the following functions be one-one? 


Gi) Ge = x0? — 3x2. 
Gi) ef (Odie xr? sc an: 
Gil) f(x) = (1 +x2)7!. 


fw) fey. 


Suppose that f 1s differentiable with derivative f’(x) = (1 + x3)—!/2, Show 
fiat 2 — at satisfies g”(x) = 5a (x)?. 


Suppose that f is a one-one function and that f~! has a derivative which is 
nowhere 0. Prove that f is differentiable. Hint: There is a one-step proof. 


As a follow up to Problem 10-17, what additional condition on g will insure 
that f is differentiable? 


Find a formula for of | Gee 
Prove that if f’(f7'(~)) 4 Oand f (f7!(x)) exists, then (£7!) (x) exists. 
The Schwarzian derivative & f was defined in Problem 10-19. 


(a) Prove that if G f(x) exists for all x, then G f~!(x) also exists for all x in 
the domain of f-!. 
(b) Find a formula for Sf ~!(x). 


*14, 


(b) 
() 
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Prove that there is a differentiable function f such that (7op|P + f(x)+ 


x = 0 for all x. Hint: Show that f ean be expressed as an inverse 


function. The easiest way to do this is to find f~'. And the easiest way 
to do this is to set x = oy): 

Find f’ in terms of f, using an appropriate theorem of this chapter. 
Find f’ in another way, by simply differentiating the equation defining /. 


The funetion in Problem 14 is often said to be defined implicitly by the 
equation y>+y+x = 0. The situation for this equation is quite special, however. As 
the next problem shows, an equation does not usually define a function implicitly 
on the whole line, and in some regions more than one function may be defined 


implicitly. 


15. 


(a) 


(b) 


What are the two differentiable functions f which are defined imphcitly 
on (—1, 1) by the equation x+y” = 1, i.e., which satisfy x?+[f (x)]? = 1 
for all x in (—1, 1)? Notice that there are no solutions defined outside 
[=a A). 

Which functions f satisfy x* + [f(x)]? = —1? 


*(c) Which differentiable functions f satisfy [f(x)]* — 3 f(x) = x? Hint: It 


will help to first draw the graph of the function g(x) = x? — 3x. 


In general, determining on what intervals a differentiable function is defined im- 
plicitly by a particular equation may be a delicate affair, and is best diseussed in the 


context of advanced caleulus. If we assume that f is such a diflerentiable solution, 
however, then a formula for f’(x) can be derived, exactly as in Problem 14(c), by 
differentiating both sides of the equation defining f (a proeess known as “implicit 


differentiation’ ): 


16. 


Ly. 


18. 


19. 


(a) 


(b) 


Apply this method to the equation [f(x)]? +x? = 1. Notice that your 
answer will involve f(x); this is only to be expected, since there is more 
than one function defined implicitly by the equation y? + x? = 1. 

But check that your answer works for both of the functions f found in 
Problem 15(a). 

Apply this same method to [f(x)]* — 3 f(x) = x. 


Use implicit differentiation to find f’(x) and f”(x) for the functions f 
defined implicitly by the equation x3 + y? = 7. 

One of these functions f satisfies f(—1) = 2. Find f’(—1) and f”’(—1}) 
for this f. 


The eolleetion of all points (x, y) sueh that 3x3 + 4x7y — xy? + 2y3 = 4 
forms a eertain curve in the plane. Find the equation of the tangent line to 


this curve at the point (—1, 1). 


Leibnizian notation is particularly convenient for imphcit differentiation. Be- 
cause y is so consistently used as an abbreviation for f(x), the equation in x 
and y which defines f implicitly will automatically stand for the equation 
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20. 


20. 


2. 


23. 


24 


which f is supposed to satisfy. How would the followmg computation be 
written i our notation? 


y by xy ee 


pee 3 PEE Recs = 
: cial : aa ae : 
dy =) 


dx 4y3 4 3y24x- 


4 


As long as Leibnizian notation has entered the picture, the Leibnizian no- 
tation for derivatives of inverse functions should be mentioned. If dy/dx 
denotes the derivative of f, then the derivative of f~! is denoted by dx/dy. 
Write out Theorem 5 in this notation. The resulting equation will show you 
another reason why Leibnizian notation has such a large following. It will 
also explain at which point (f~!)’ is to be calculated when using the dx /dy 
notation. What is the significance of the following computation? 


n 


na 
y= Va 
ax!” = (all 
dx dx dx oe le 


dy 


Suppose that f is a differentiable one-one function with a nowhere zero 
derivative anc that f — FF’. Let G(x) = gia 46°) = ECG). Prove that 
G'(x) = f-!(x). (Disregarding details, this problem tells us a very interesting 
fact: if we know a function whose derivative is f, then we also know one 
whose derivative is f~!. But how could anyone ever guess the function G? 
‘Two different ways are outlined in Problems 14-14 and 19-16.) 


Suppose /i is a function such that h’(x) = sin? (sin (x wl) and #¢(O) 3. 
Find 


1) (fh DiGi 
ii) (B-')'(3), where B(x) = h(x + 1). 


( 

( 

(a) Prove that an increasing and a decreasing function intersect at most once. 

(b) Find two continuous increasing functions f and g such that f(«) = g(x) 
precisely when x is an integer. 

(c) Find a continuous increasing function f and a continuous decreasing 
function g, defined on R, which do not intersect at all. 


(a) If f is a continuous function on R and f = f~!, prove that there is at 
least one x such that f(x) = x. (What does the condition f = f~! mean 
gcometrically?) 

(b) Give several examples of continuous f such that f = yl and f@)=% 
for exactly one x. Hint: ‘Try decreasing f, and remember the geometric 
interpretation. One possibility is f(v) = —x. 


#2. 


26. 


a B 


(c) 
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Prove that if f is an increasing function such that f = f7!, then 


f(x) = x for all x. Hint: Although the geometric interpretation will 
be immediately convincing, the simplest proof (about 2 lines) is to rule 
out the possibilites f(x) < x and f(x) > x. 


Which functions have the property that the graph is still the graph of a func- 
tion when reflected through the graph of —/ (the “antidiagonal”)? 


A function f is nondecreasing if f(x) < f(y) whenever x < y. (To be 


more precise we should stipulate that the domain of f be an interval.) A 
nonincreasing function is defined similarly. Caution: Some writers use 


“increasing” instead of “nondecreasing,” and “strictly creasing” for our 


“increasing, » 


(a) 


Prove that if f is nondecreasing, but not increasing, then f is constant 
on some interval. (Beware of unintentional puns: “not increasing” is not 
the same as “nonincreasing,’”’) 

Prove that if f is differentiable and nondecreasing, then f’(x) > O for 
all x. 

Prove that if f’(x) => 0 for all x, then f is nondecreasing. 


Suppose that f(x) > 0 for all x, and that f is decreasing, Prove that 
there is a continuous decreasing function g such that 0 < g(x) < f(x) for 
all x. 

Show that we can even arrange that g will satisfy dim gees (x = 0. 
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APPENDIX. PARAMETRIC REPRESENTATION OF CURVES 


The material in this chapter serves to emphasize something that we noticed a 
long ume ago—a perfectly nice looking curve need not be the graph of a function 
(Figure 1). In other words, we may not be able to describe it as the set of all points 
(x, f(x)). Of course, we might be able to describe the curve as the set of all points 
(f(x),x); for example, the curve in Figure | is the set of all pots (x?,x). But 
even this trick doesn’t work in most cases. It won’t allow us to describe the circle, 
consisting of all points (x, y) with x? + y? = 1, or an ellipse, and it can’t be used 
to describe a curve like the one in Figure 2. 


The simplest way of describing curves im the plane m general harks back to the 
FIGURE 1 physical conception of a curve as the path of a particle moving in the plane. At 
each time f, the particle is at a certam point, which has two coordinates; to indicate 
the dependence of these coordmates on the time f, we can cal] them u(t) and v(t). 
Thus, we end up with @o functions. Conversely, given two functions u and v, we 
can consider the curve consisting of all points (w(t), v(t)). This curve is said to 
be represented parametnrcally by u and v, and the pair of functions u, v is called a 
parametric representation of the curve. The curve represented parametrically by 
u and v thus consists of all pairs (x, y) with x = u(t) and y = v(t). It is often 
described briefly as “the curve x = u(t), y = v(t).” Notice that the graph of a 
function f can always be described parametrically, as the curve x =f, y = f(t). 
Instead of considering a curve in the plane as defined by two functions, we 
can obtain a conceptually simpler picture if we broaden our original definition of 
function somewhat. Instead of considering a rule which associates a number with 
another number, we can consider a “function ¢ from real numbers to the plane,” 
FIGURE 2 1e., a rule c that associates, to each number ft, a pot in the plane, which we can 
denote by c(t). With this notion, a curve is just a function from some interval of 
real numbers to the plane. 
Of course, these two different descriptions of a curve are essentially the same: 
A pair of (ordinary) functions u and v determines a single function ¢ from the real 
numbers to the plane by the rule 


eG) =u), va 


and, conversely, given a function ¢ from the real numbers to the plane, each c(t) 
is a point m the plane, so it ts a pair of numbers, which we can call u(t) and v(t), 
so that we have unique functions u and v satisfying this equation. 
In Appendix 1 to Chapter 4, we used the term “vector” to describe a pomt in 
the plane. In conformity with this usage, a curve in the plane may also be called 
e(t) a “vector-valued function.” The conventions of that Appendix would lead us to 
write c(t) = (ey (t), c2(7)), but m this Appendix we'll continue to use notation like 
—- —_— c(t) = (u(t), v(t)) to mmimnize the use of subscripts. 


A simple example of a vector-valued function that is quite useful ts 


e(t) = (cost, sin), 


FIGURE 3 which goes round and round the unit circle (igure 3). 


FIGURE 4 
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For two (ordinary) functions f and g, we defined new functions f+ g and f-g 
by the rules 


(1) (fe og) = 7) = oly). 
(2) Giereix) = f(y 2). 


Since we have defined a way of adding vectors, we can imitate the first of these def- 
imitions for vector-valued functions c and d: we define the vector-valued function 
c+d by 


(errr = Gil) ad (ms 


where the + on the right-hand side is now the sum of vectors. ‘This simply amounts 
to saying that if 


c(t) = (u(t), v(t), 
d@y Swit), zw), 


then 
(c +d)(t) = (w(t), v(t) + (wd), 21) = (ue) + w(t), v(t) + 2). 


Recall that we have also defined a -v for a number a and a vector v. To 
extend this to vector-valued functions, we want to consider an ordinary fuaction a 
and a vector-valued function c, so that for each t we have a number a(t) and a 
vector c(t). Then we can define a new vector-valued function @ - ¢ by 


(a -c)(t) = a(t) - c(t), 


where the - on the right-hand side is the product of a number and a vector. This 
sunply amounts to saying that 


(a -c)(t) =a(t) - (a(t), v(t) = (a(r) -u(t), a(t) - v(t)). 
Notice that the curve @ - e, 


(a-e)(t) = (a(t) cost, a(t)sint), 


is already quite general (Figure 4). In the notation of Appendix 3 to Chapter 4, 
the point (@-e)(t) has polar coordinates @(t) and f, so that (@-e)(t) is the “graph 
of @ in polar coordinates.” 

Even more generally, given any vector-valued function c, we can define new 
functions r and @ by 


c(t) = r(t)- e(@(t)), 


where r(t) 1s just the distance from the origin to c(t), and @(t) is some choice of 
the angle of c(r) (as usual, the function @ isn’t defined unambiguously, so one has 
to be careful when using this way of writing an arbitrary curve c). 

We aren't in a position to extend (2) to vector-valued functions m general, since 
we haven't defined the product of two vectors. However, Problems 2 and 4 of 
Appendix | to Chapter 4 define two real-valued products v + w and det(v. w). It 
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FIG RE 5 


c(a +h) 


c(a) 


should be clear, given vector-valued functions c and d, how we would define two 
ordinary (real-valued) functions 


cd and det(c, d). 
Beyond imitating simple arithmetic operations on functions, we can consider 
more interesting problems, like limits. For c(t) = (u(t), v(t)), we can define 


(x) lim c(t) = lima (2 (t), v(t)) tobe (tim u(t), lim v(0)) ; 
td mitt t 


pg a8} 


Rules like 


live -+- = leinc— lind, 
hcl ta [>a 


lina +c = lma(t)- lime 
(Xe! 


a fq 


follow immediately. Problem 10 shows how to give an equivalent definition that 
umitates the basic definition of limits directly. 
Limits lead us of course to derivatives. For 


elt) = Gr), v7) 
we can define c’ by the straightforward definition 
c'(a) = (u'(a), v'(a)). 
We could also try to imitate the basic definition: 
F . e(at+h) —c(a) 
c (a) = hm —————__, 
h-0 h 


where the fraction on the right-hand side is understood to mean 


] 
=o eles) Soe. 
h 
As a matter of fact, these two definitions are equivalent, because 


. ce(at+th) —c(a) . ulath)—ula) vlat+th)— v(a) 
i  .  —————————oee 


h-0 h h-0 hh h 


. ulath)—u(a) .. vlath)— v(a) 
= {| hm —-_,, ims, 


h—0 h h0 h 


by our definition () of limits 
= (u'(a), v'(a)). 


Figure 5 shows c(a +) and c(a), as well as the arrow from c(a) to c(a +h); 
as we showed in Appendix | to Chapter 4, this arrow is c(a +h) — c(a), except 
moved over so that it starts at c(a). As h — 0, this arrow would appear to move 
closer and closer to the tangent of our curve, so it seems reasonable to define the 
tangent line of ¢ at c(a) to be the straight line along c’(a), when c’(a) 1s moved 
over so that it starts at c(a). In other words, we define the tangent line of ¢ at c(a) 
as the set of all points 

clays -c (2); 
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for s = 0 we get the point c(a) itself, for s = 1 we get c(a) + c’(a), etc. (Note, 
however, that this definition does not make sense when c’(a) = (0, 0).) Problem | 
shows that this definition agrees with the old one when our curve c is defined by 


c(t) = (f, f)), 
so that we simply have the graph of /f. 
Once again, various old formulas have analogues. For example, 
(c +d) (a) =c'(a)+d'(a), 
(a-c) (a) =a"(a) -c(a) +a(a)-c'(a), 
or, as equations involving functions, 
(BPO) Se aa. 
(a-c) =a'-ct+a-c. 
These formulas can be derived immediately from the definition in terms of the 
component functions. They can also be derived from the definition as a limit, 
by imitating previous proofs; for the second, we would of course use the standard 
trick of writing 
a(a+h)c(a+h)—ala)c(a) = 
a(a+h)-[e(a +h) —c(a)]) + [e@t+h) —a(a)]-c(a). 
We can also consider the function 
d(t) =c(p(t)) = (co p)(t), 


where p is now an ordinary function, from numbers to numbers. ‘The new curve d 
passes through the same points as c, except at different times; thus p corresponds 
to a “reparameterization” of c. For 


c=(u, v), 
ad = Kec p, vo p'), 


we obtain 


d'(a) = ((wo p)'(a), (vo p)'(a)) 
= (p'(a)u'(p(a)). p'(ayu'(p(a))) 
=§p (a) (u'(p(a)). v'(p(a))) 
= p'(a)-c'(pta)), 


or simply 
oe ACY 


Notice that if p(a) = a, so that d and c actually pass through the same point at 
time a, then d’(a) = p’(a)-c’(a). so that the tangent vector d’(a) is just a multiple 
of c'(a). This means that the tangent /ine to c at c(a) is the same as the tangent 
line to the reparameterized curve d at d(a) = c(a). The one exception occurs 
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when p’(a) = Q, since the tangent line for d is then undefined, even though the 
tangent line for c may be defined. For example, d(1) = c(t?) won't have a tangent 
line defined at ¢ = 0, even though it’s merely a reparameterization of c. 

Finally, since we can define real-valued functions 


(cag) (a) = c(t) - dw) 
det(craiG@) = det(c(s), aa), 


we ought to have formulas for the derivatives of these new functions. As you might 
guess, the proper formulas are 


(c-d)'(a) =c(a)-d'(a) + c'(a)+-d(a), 
[det(c, d)|/(a) = det(c’, d)(a) + det(c, d’)(a). 


These can be derived by straightforward calculations from the definitions in terms 
of the component functions. But it is more elegant to imitate the proof of the or- 
dinary product rule, using the simple formulas in Problems 2 and 4 of Appendix 1 
to Chapter 4, and, of course, the “standard trick” referred to above. 


PROBLEMS 


I. (a) Fora function f, the “poimt-slope form” (Problem 4-6) of the tangent 
line at (a, f(a)) can be written as y — f(a) = (x — a) f’(a), so that the 
tangent line consists of all points of the form 


(x, f(a) + (x — a) f'(a)). 


Conclude that the tangent line consists of all poimts of the form 
(a+s, f(a) +sf'(a)). 


(b) If c is the curve c(t) = (t, f(t)), conclude that the tangent line of c at 
(a, f(a)) [using our new definition] is the same as the tangent line of f 


ate(a, {(@); 


2, Leta) = (fier 17), where f is the function shown in Figure 21 of Chap- 
ter 9. Show that ¢ hes along the graph of the non-differentiable function 
h(x) = |x|, but that c’(0) = (0, 0). In other words, a reparameterization can 
“hide” a corner. For this reason, we are usually only interested in curves ¢ 
with c’ never equal to (0, 0). 


3. Suppose that x = w(t), y = v(t) is a parametric representation of a curve, 
and that w is one-one on some interval. 


(a) Show that on this interval the curve lies along the graph of f = vou7!, 
(b) If wis differentiable on this interval and u’(t) 4 0, show that at the point 
x = #7) we have 
. v(t) 
f= 


u'(t) 


FIGURE 6 


P 


(c) 
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In Leibnizian notation this is often written suggestively as 


dy 
dy = dt 
dx dx 

dt 


We also have 
u'(t)v"(t) — v’(t)u’(t) 


in (u'(1))3 


Consider a function f defined implicitly by the equation x7 + y?/? = 1. 
Compute f’(x) in two ways: 


(i) 
(ii) 


By implicit differentiation. 


: c : : 53) 
By considering the parametric representation x = cos*t, y = sin” t. 


Let x = u(t), y = v(t) be the parametric representation of a curve, with 


u and v differentiable, and let P = (xo, yo) be a point in the plane. Prove 
that if the pomt Q = (u(), v(7)) on the curve is closest to (xo, yo), and u’(f) 
and v’(t) are not both 0, then the line from P to Q is perpendicular to the 
tangent line of the curve at Q (Figure 6). The same result holds if Q is 


furthest from (xo, yo). 


We've seen that the “graph of f in polar coordinates” is the curve 


(f-e)() = GF) cost, f@) sm); 


in other words, the graph of f in polar coordinates is the curve with the para- 


metric representation 


6. 


(a) 


(b) 


fa— 7G) cos: y= f(@) suae: 


Show that for the graph of f in polar coordinates the slope of the tangent 
line at the point with polar coordinates (f (6), 8) 1s 


—f(@)sin@ + (0) cosé- 


Show that if f(@) = 0 and f is differentiable at 6, then the line through 
the origin making an angle of 6 with the positive horizontal axis is a 
tangent line of the graph of f in polar coordinates. Use this result to 
add some details to the graph of the Archimedean spiral in Appendix 3 
of Chapter 4, and to the graphs in Problems 3 and 10 of that Appendix 
as well. 

Suppose that the point with polar coordinates (f(@). 6) is further from 
the origin O than any other point on the graph of f. What can you 
say about the tangent line to the graph at this point? Compare with 
Problem 5. 
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FIGURE 7 


(d) Suppose that the tangent line to the graph of f at the point with po- 


(b) 


lar coordinates (f(@),@) makes an angle of @ with the horizontal axis 
(Figure 7), so that a — @ is the angle between the tangent line and the 
ray from O to the point. Show that 


f() 

f'(O) 

In Problem 8 of Appendix 3 to Chapter 4 we found that the cardioid 
r = | —sin@ 1s also described by the equation Cue) = ae 


tan(a —8@) = 


Find the slope of the tangent line at a point on the cardioid in two ways: 
(i) By implicit differentiation. 
(ii) By using the previous problem. 
Check that at the origin the tangent lines are vertical, as they appear to 
be in Figure 8. 


Fala ERS 


The next problem uses the material from Chapter 15, m particular, radian 


measure, and the inverse trigonometric functions and their properties. 


8. A cycloid is defined as the path traced out by a point on the rim of a rolling 
wheel of radius a. You can see a beautiful cycloid by pasting a reflector on 


the edge of a bicycle wheel and having a friend ride slowly in front of the 


headlights of your car at night. Lacking a ear, bicycle, or trusting friend, you 


can settle instead for Figure 9. 


FGA 9 


FIGURE 10 


length at 
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(a) Let w(t) and v(t) be the coordinates of the poimt on the rim after the 
wheel has rotated through an angle of ¢ (radians). This means that the 
arc of the wheel rim from P to Q in Figure 10 has length at. Since the 
wheel is rolling, at is also the distance from O to Q. Show that we have 
the paramctric represcntation of the cycloid 


u(t) = a(t —sint) 
v(t) = a(1 —costr). 


Figure 11 shows the curves we obtain if the distance from the point to the 
center of the wheel is (a) less than the radius or (b) greater than the radius. 
In the latter case, the curve is not the graph of a function; at certain times 
the point is moving backwards, even though the wheel is moving forwards! 


(b) 


FIGURE 11 


In Figure 9 we drew the cycloid as the graph of a function, but we really 
need to check that this is the case: 


(b) Compute u/(t) and conclude that wu is increasing. Problem 3 then shows 
that the cycloid is the graph of f = vou7!, and allows us to compute 


fay 


(c) Show that the tangent lines of the cycloid at the “vertices” are vertical. 
It isn’t possible to get an explicit formula for f, but we can come close. 


(d) Show that 


a: (t 
u(t) =a arccos seen) 3. 2 olay: 


a 


Hint: first solve for ¢ in terms of v(t). 
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FIGURE 12 


10. 


(ec) The first half of the first arch of the cycloid is the graph of g7! 


a—y¥ 
g(y) = aarccos —= /@a— yy. 


a 


, where 


Let u and v be continuous on [a,b] and differentiable on (a,b); then u 
and v give a parametric representation of a curve from P = (u(a), v(a)) 
to Q = (u(b), v(b)). Geometrically, it seems clear (Figure 12) that at some 
point on the curve the tangent line is parallel to the lme segment from P 
to Q. Prove this analytically. Hint: This problem will give a geometric 
interpretation for one of the theorems in Chapter 11. You will also need to 
assume that we don’t have u’(x) = v'(x) = O for any x in (a, b) (compare 
Problem 2). 


The following definition of a limit for a vector-valued function is the direct 
analogue of the definition for ordinary functions: 


lim c(t) = 7 means that for every ¢ > O there is some 6 > O such that, for 
[>a 


all r, if O < |r —a| < 6, then ||c(t) —1|| < e. 


Here || || is the norm, defined in Problem 2 of Appendix | to Chapter 4. If 
i = (1), 6) when 


let) — U7 = Ju) — 4? + |v) — bP. 


(a) Conclude that 
u(t) Fy [-S flett) — fll and wl) — | sie) — 11], 


and show that if lim c(t) =/ according to the above definition, then we 
, ; 


rat 2 


also have 


—>a t>a 


lim w(t) = and lim v(t) = bh, 


so that lim c(t) =/ according to our definition (*) in terms of component 
ae ate! : 


functions, on page 246. 
(b) Conversely, show that if lim c(t) = / according to the definition in terms 
é toa WO 


of component functions, then also lim c(t) = / according to the above 
: 


=a 


definition. 


(a, 0) 


FIGURE 1 


FIGURE 2 


FEGRS) 3 


CHAPTER 


INTEGRALS 


The derivative does not display its full strength until allied with the “mtegral,” the 
second main concept of Part III. At first this topic may seem to be a complete 
digression— in this chapter derivatives do not appear even once! The study of 
integrals does require a long preparation, but once this prelimimary work has been 
completed, integrals will be an invaluable tool for creating new functions, and the 
derivative will reappear i Chapter 14, more powerful than ever. 

Although ultimately to be defined in a quite complicated way, the integral for- 
malizes a simple, intuitive concept that of area. By now it should come as 
no surprise to learn that the definition of an intuitive concept can present great 
difficulties —‘“‘area” is certainly no exception. 

In elementary geometry, formulas are derived for the areas of many plane fig- 
ures, but a little reflection shows that an acceptable definition of area is seldom 
given. ‘The area of a region 1s sometimes defined as the number of squares, with 
sides of length 1, which fit in the region. But this definition is hopelessly inadequate 
for any but the simplest regions. For example, a circle of radius 1 supposedly has 
as area the irrational number z, but it is not at all clear what “z squares” means. 
Even if we consider a circle of radius 1//z, which supposedly has area 1, it is hard 
to say in what way a unit square fits m this circle, smce it does not seem possible 
to divide the unit square into pieces which can be arranged to form a circle. 

In this chapter we will only try to define the area of some very special regions 
(Figure 1)—-those which are bounded by the horizontal axis, the vertical lines 
through (a,0) and (b,0), and the graph of a function f such that f(x) > 0 
for all x in [a, b]. It is convenient to indicate this region by R(f. a. b). Notice that 
these regions include rectangles and triangles, as well as many other important 
geometric figures. 

The number which we will eventually assign as the area of R(f, a,b) will be 
called the itegral of f on [a,b]. Actually, the integral will be defined even for 
functions f which do not satisfy the condition f(x) > O for all x in [a,b]. If f is 
the function graphed in Figure 2, the integral will represent the difference of the 
area of the hghtly shaded region and the area of the heavily shaded region (the 
“aloebraic area” of RC f.a, b)). 

The idea behind the prospective definition is indicated in Figure 3. ‘Vhe interval 
[a, b] has been divided into four subintervals 


[oon] [hoe] [oe] [6,4] 
by means of numbers fo, f), f2, 8, 4 with 
Gai) — i <p —eiape i —ap 


(the numbering of the subscripts begins with O so that the largest subscript will 
equal the number of subintervals). 


ps) 
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DEFINITION 


DEFINITION 


On the first mterval [fo, t)| the function f has the mmimum value my, and the 
maximum value M,; similarly, on the 7th interval [4—1, ¢;] let the minimum value 
of f be m; and let the maximum value be M;. The sum 


Ss = my(t) — (9) + m2(t2 — 11) +: 113(& — t2) ting(t4 — 3) 


represents the total area of rectangles lying inside the region R(f, a,b), while the 
sum 


S = Mi Gp—to) + Mo — 4) + M36 —b) + M4 — Bs) 


represents the total area of rectangles contaiming the region R(f, a,b). The guid- 
ing principle of our attempt to define the area A of R(f, a,b) is the observation 
that A should satisfy 

c-Acand A= S, 


and that this should be true, 70 matter how the interval [a, b| is subdivided. It is to be 
hoped that these requirements will determine A. The following definitions begin 
to formalize, and elimimate some of the implicit assumptions 1n, this discussion. 


Let a < b. A partition of the interval [a, b| is a finite collection of poimts in 


[a,b], one of which is a, and one of which is b. 


The poimts in a partition can be numbered ft, ..., fr so that 
GC —Hheced p< - °° <elge le i SO. 


we shall always assume that such a numbering has been assigned. 


Suppose f is bounded on [a,b] and P = {f,.... t,} is a partition of [a,b]. Let 
ie — ii | x ) | oem et, 
iT ey) elements 

The lower sum of f for P, denoted by L(f. P), is defined as 


n 


ges Yo milt Saat 2s ise 


i! 


The upper sum of f for P, denoted by U(f, P), is defined as 


U(f, P)= So Milt sabia) 


I 


Vhe lower and upper sums correspond to the sums s and S in the previous 
example; they are supposed to represent the total areas of rectangles lying below 
and above the graph of f. Notice, however, that despite the geometric motivation, 
these sums have been defined precisely without any appeal to a concept of “area.” 


FIGURE 4 


PUG RES 


LEMMA 


PROOF 
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Two details of the definition deserve comment. The requirement that f be 
bounded on [a,b] is essential in order that all the m; and M; be defined. Note, 
also, that it was necessary to define the numbers m; and M; as inf’s and sup’s, 
rather than as minima and maxima, since f was not assumed continuous. 

One thing is clear about lower and upper sums: If P is any partition, then 


LFaclnbe AUC), fe), 


because 


n 


L(f, P= Yomilt =i 


i=l 


Uf. P) =) Mili — 4-1). 


i 


and for each 7 we have 
ie OV eet) 


On the other hand, something less obvious ought to be true: If P; and P> are 
any two partitions of [a,b], then it should be the case that 


Lh Ce amaseU( f, P3), 


because L(f, P)) should beg=sanearn@ 71). ands U(f, P>) shouldibe:—sarea 

R(f, a,b). This remark proves nothing (since the “area of R(f, a, b)” has not even 
been defined yet), but it does indicate that if there is to be any hope of defining the 
area of R(f. a,b), a proof that L(f, P}) < UCf, P2) should come first. The proof 
which we are about to give depends upon a lemma which concerns the behavior of 
lower and upper sums when more points are included in a partition. In Figure 4 
the partition P contains the points in black, and Q contains both the points in 
black and the points in grey. ‘The picture indicates that the rectangles drawn for 
the partition Q are a better approximation to the region R(f. a. b) than those for 
the original partition P. To be precise: 


If Q contains P (1e., if all points of P are also in Q), then 


Dae) = Lf 0). 
U(f. P) = UC, Q). 


Consider first the special case (Figure 5) in which Q contains just one more point 
tice: 


where 


a9 =a = ee Kh, = 
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THEOREM 1 


Let 


Hie = wil { ff X) > heel ee ee, 
=n pW) > es xe ee 


~ 

~ 

~ 
>= 


Then 


i 


Lif, P= Saami (6 ==), 


| 
k-] Hn 

EGO) = So mili —f-)) tm’ (u — th 1) tim" (y — uw) t+ y mj (t) — t—1). 
iil i=k+1 


To prove that L(f, P) < L(f, Q) it therefore suffices to show that 
my (th — te_1) < im’ (u — th_)) ta" (me — KB). 


Now the.set (fQ)Em 4 = x < t) Contains allMheminmbersanet? (x) sath) = 
x < uj}, and possibly some smaller ones, so the greatest lower bound of the first set 
is less than or equal to the greatest lower bound of the second; thus 


my <a. 


Similarly, 


Pi je cS We 
Therefore, 
ni (tk — 1) = Re — G1) Hint oe) < a (a — G4) +m’ G — @). 


This proves, in this special case, that L(f, P) < L(f. Q). The proof that U(f, P) = 
U(f, Q) is similar, and is left to you as an easy, but valuable, exercise. 

The general case can now be deduced quite easily. ‘The partition Q can be 
obtained from P by adding one point at a time; in other words, there is a sequence 
of partitions 


oa POPs . ee P— Oo 
such that Pj;) contains just one more point than P;. ‘Then 
Lj, P) = EGR Lf.) = ++ = Lf, Ba) LOO): 


and 
Uf, P)= UgarikesU (f. Ps) = --- =U GPa eae an 


The theorem we wish to prove is a simple consequence of this lemma. 


Let P} and P3 be partitions of [a,b], and let f be a function which is bounded 
on [a,b]. khan 
L ( fgets): 


PROOF 


a=oh b 


Felner REG) 
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There is a partition P which contains both P; and P3 (let P consist of all points 
in both Py and Pz). According to the lemma, 


LG ep sveme) = Uf, P) 2 Uf, Paya 


It follows from Theorem | that any upper sum U(/. P’) is an upper bound for 
the set of all lower sums L(f, P). Consequently, any upper sum U (f, P’) 1s greater 
than or equal to the /east upper bound of all lower sums: 


sup{L(f, P) : P a partition of [a, b]} < UCf, P’), 


for every P’. This, in turn, means that sup{L(f, P)} is a lower bound for the set 
of all upper sums of f. Consequently, 


SUD fe) fe tr yr) 


It is clear that both of these numbers are between the lower sum and upper sum 
of f for all partitions: 


LG, P’) <isupl LG, Py) age’ 
LG P)=mt{(Og, By UB), 


for all partitions P’. 
It may well happen that 


Sip) {2G femee j= mleh Cf. P}; 


in this case, this 1s the only number between the lower sum and upper sum of f 
for all partitions, and this number is consequently an ideal candidate for the area 


of R(f.a.b). On the other hand, if 
sup{L(f, P)) < inf{UCS, PD}. 
then every number x between sup{L(f, P)} and inf {U(f, P)} will satisfy 
LG@Ap =.= UEP) 


for all partttons P’. 

It is not at all clear just when such an embarrassment of riches will occur. ‘The 
following two examples, although not as interesting as many which will soon ap- 
pear, show that both phenomena are possible. 

Suppose first_that f(x) = ¢ fomall x in |@, b] (Figure 6). If P = {%, ..., Bit 1S 
any partition of [a,b], then 

na M; = c., 


sO 
Le B) = Ye c(t, a1) =cb— a), 
i=] 
U(f, P) = ae =f) = 4h — a). 


i=] 


258 Derwwatives and Integrals 


In this case, all lower sums and upper sums are equal, and 
sup(L(f, P)} = mit Fe) == a). 
Now consider (Figure 7) the function f defined by 


: Q, x irrational 
yx) = 


le Vx ratronal: 


lif P = fo." .. saison partition, then 


m; = 0, since there is an irrational number in [#-1. &], 


and 


Secccccccsocoone 


i= M; = 1, since there 1s a rational number in [f;-1, t]. 


BERG REZ. ‘Therefore, 


L(f, P) = ) 0° ~f-1) =0. 


iI 


Ut ei.) wal -\(f ee 


i=l 


Thus, in this case it is certainly nof true that sup{L(f, P)} = mf {UCf, P)}. The 
principle upon which the definition of area was to be based provides insufficient 
information to determine a specific area for R(f,a,b)--any number between 0 
and b — a seems equally good. On the other hand, the region R(f, a,b) is so 
weird that we might with justice refuse to assign it any area at all. In fact, we can 
maintain, more generally, that whenever 


supe /, FP) A miu Ge) 


the region R(f, a, b) is too unreasonable to deserve having an area. As our ap- 
peal to the word “unreasonable” suggests, we are about to cloak our ignorance m 
terminology. 


DEFINITION A function f which is bounded on [a, b] is integrable on [a,b] if 
sup{L(/,,.P) : P ajparatonyot [a,b]}' = mf{0( f,P) 2 P a partinon of [a, b]}. 


In this case, this common number 1s called the integral of f on [a,b] and is 


denoted by 
b 


I. 


a 


(The symbol f is called an integral sign and was originally an elongated s, for 
“sum;” the numbers a and b are called the lower and upper limits of integration.) 
‘The mtegral Vv f is also called the area of R(f, a,b) when f(x) > 0 for all x 
i [a, pI. 


THEOREM 2 


PROOF 


13. Integrals 259 


If f is mtegrable, then according to this definition, 


b 
LG wees f <U(f, P) for all partitions P of [a, bd]. 


a 


Moreover, ie f is the unque number with this property. 

‘This definition merely pinpomts, and does not solve, the problem discussed 
before: we do not know which functions are integrable (nor do we know how to 
find the integral of f on [a,b] when f is integrable). At present we know only 
two examples: 


b 
Cl) eufiey G@) =e) them / is inteerable’on ia, | and jf =Ct-(b— 2). 
a 
(Notice that this integral assigns the expected area to a rectangle.) 


Q, x irrational 
1, x rational, 


C2) ley (x) then f is not integrable on [a, b]. 


Several more examples will be given before discussing these problems further. 
Even for these examples, however, it helps to have the following simple criterion 
for integrability stated explicitly. 


If f is bounded on [a,b], then f is mtegrable on [a,b] if and only if for every 
€ > 0 there 1s a partition P of [a,b] such that 


UCP) — LP ya 


Suppose first that for every ¢ > 0 there is a partition P with 
O Ga a jer) ae 
Since 


uot (UC (Sey) Career, 
SUDA dol mule ee ay aes 


it follows that 
inhi 2) — supiees, 2) <a 


Since this is true for all ¢ > Q, it follows that 
supIIGwe Nini Chap): 


by definition, then, f is integrable. ‘The proof of the converse assertion is similar: 
If f is mtegrable, then 


supe (PP) — mit Unie Rot 
This means that for each ¢ > 0 there are partitions P’, P” with 


WCEP") SGP) <2 
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FIGURE 8 


Let P be a partition which contains both P’ and P”. Then, according to the 
lemma, 


OR W OE SGe )). 
BY 1e)) Jiweeiesp 


consequently, 


CCP) LF. PyYZUG P)—LG, FP) <e2.f 


Although the mechanics of the proof take up a httle space, it should be clear 
that Theorem 2 amounts to nothing more than a restatement of the definition 
of integrability. Nevertheless, it is a very convenient restatement because there is 
no mention of sup’s and inf’s, which are often difficult to work with. The next 
example illustrates this pomt, and also serves as a good mtroduction to the type 
of reasoning which the complicated definition of the integral necessitates, even m 
very simple situations. 


Let f be defined on [0, 2] by 


0," Zal 
Loo ‘ = 
he oxe=se 
Suppose P = {%o,....%,} 1s a partition of [0,2] with 


hile & Sahl 


(see Figure 8). Then 
mm =M; = OMe ea, 


but 
omy al) and M; =1. 
Since 
Hes s 
LES So mili St) 1 — Gee a >) m;(t; — tj—1). 
ial Sy 


j-1 " 
Uf. P) = 0 Milt — 4-1) + MG —G-D + D5 MiG — §-1). 
i= i=jt+l 


we have 


ON, Pye LF PH = Gan. 
This certainly shows that f is mtegrable: to obtain a partition P with 
US, 2 ine LU, 2) = €, 
it is only necessary to choose a parution with 
Go =< be; Baice 1, ee Tee 
Moreover, it 1s clear that 


L(f, P) <0 <U(f, P) for all partitions P. 
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Since f is integrable, there is only one number between all lower and upper sums, 
namely, the integral of f, so 
A 
{ ro 
0 


Although the discontinuity of f was responsible for the difficulties in this exam- 
ple, even worse problems arise for very simple continuous functions. For example, 
let f(x) = x, and for simplicity consider an interval [0,6], where b > 0. If 
P = {to,...,t,} is a partition of [0, b], then (Figure 9) 

m; = tj-1 and M; =1t,; 


and therefore 


L(f, P)= yt =4=})) 
i=l 


= Io(t) —1o) Hite — 4) + +414 — th-1), 


UF, BEy= at = =i) 


j=] 


= h(t) —to) +h(t2 —t1) +°°* + tht — th-1). 


FIGURE 9 Neither of these formulas is particularly appealing, but both simplify considerably 
for partitions P, = {to,...,t} into n equal subintervals. In this case, the length 
t; — t;-, of each subinterval is b/n, so 


fo = 0, 
b 
y=, 
n 
2b 
I = —,. Cc, 
n 
in general 
ib 
j= — 
n 


Then 


LG, P= > 4-10 =i) 


i=l 
53 (i—1)b| b 
- j=l n n 
= [ou | 
= n~ 
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FIGURE 10 


Remembering the formula 


this can be written 


Similarly, 


U(f, Pn) =) tilt — 4-1) 


Ls ib b 
c= 5 n 
ent} b? 
aS 
eed b? 
= ig oe 


If n is very large, both L(f, P,) and U(f, P,,) are close to b /2, and this remark 
makes it easy to show that f is integrable. Notice first that 

: 2 & 

U(f, Pn) — L(f, Ph) = -—- =: 
hn 2 

This shows that there are partitions P, with U(f, P,)—L(f, P,) as small as desired. 
By Theorem 2 the function f is integrable. Moreover, ie f may now be found 
with only a little work. It is clear, first of all, that 


b2 
Lif. Px) < SUC. Pa) forall n, 


~ aay 9 : ; ; 
Chis inequality shows only that b-/2 lies between certain special upper and lower 
sums, but we have just seen that U(f, P,) — L(f, P,) can be made as small as 
desired, so there is only one number with this property. Since the integral certainly 
has this property, we can conclude that 
b b2 
or 2 
. . . . 2) . . . . 
Notice that this equation assigns area b~/2 to a right triangle with base and alti- 
| sg gs 
tude b (Figure 10). Using more involved calculations, or appealing to Theorem 4, 
it can be shown that 
; 
6. pb a+ 


a - 


FIGURE 11 
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The function f(x) = x? presents even greater difficulties. In this case (Fig- 
ure Wah Pies. t,} is a partition of [0, b], then 


he = Th = (foa)r and M;= f(j)= i. 


Choosing, once again, a partition P, = {fo,..., t,} into n equal parts, so that 


the lower and upper sums become 


LF. Pa)'= > G1)" G4) 


i 


a ab” D 

=ye-5- 
= n- on 
b> 


U(f, Es) = fj 


Recalling the formula 
1? +--+? = Tk(k + (2k +1) 


from Problem 2-1, these sums can be written as 


L(f. Py) Be SOHO HA) 
 P,) =—» =(n — 1)(n)(2n —1), 
a3 e 1 1)(2n 
/ b> | 
n> 6 


It is not too hard to show that 


: b 
L(f; EF; ) s rw 
3 
and that U(f, P,) — L(f, P,) can be made as small as desired, by choosing n 
sufficiently large. ‘The same sort of reasoning as before then shows that 

b b3 
0 3) 


This calculation already represents a nontrivial result—the area of the region 


bounded by a parabola is not usually derived in elementary geometry. Never- 
theless, the result was known to Archimedes, who derived it in essentially the same 
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way. The only superiority we can claim 1s that in the next chapter we will discover 
a much simpler way to arrive at this result. 


Some of our investigations can be summarized as follows: 


b 
f=c8o—a if fU)y=—c forall x, 


b b2 az 
f= 7 =a iff () =~ for alll. 

b b3 3 
= ~ a = if f(x) =x? forall x. 


See ais : b : 
Vhis list already reveals that the notation f, f suffers from the lack ofa convenient 
notation for naming functions defined by formulas. For this reason an alternative 
notation,* analogous to the notation hm f(x), 1s also useful: 


Xa 
b b 
f(x)dx means precisely the same as p ih, 
da 


a 


‘Thus 


Notice that, as m the notation lim f(x), the symbol x can be replaced by any 


Ae ee 


other letter (except f, a, or b, of course): 


b b 


b b b 
f@ejdx= / ye) at = / f(a)da= i W)ay = 7 (cyde. 


a a 


The symbol dx has no meaning in isolation, any more than the symbol x —> 
has any meaning, except in the context lim f(x). In the equation 


aCe 


[ bg, Pe oe 
MEX = > Ss 
a 3 3 


* The notation ie f(x) dx is actually the older, and was for many years the only, symbol for the 
integral. Leibniz used this symbol because he considered the integral to be the sum (denoted by f) 
of infinitely many rectangles with height f(«) and “infinitely small’ width dx. Later writers used 
XQ,+++,%n to denote the points of a partition, and abbreviated x; — .xj—, by Ax;. The integral was 
Nn 
defined as the limit as Ax; approaches 0 of the sums ‘ f (xj) Ax; (analogous to lower and upper 
il 
sums). ‘The fact that the hmit is obtained by changing DS tof, f(y) to f(x), and Ax; to dx, delights 
many people. 
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the entire symbol x* dx may be regarded as an abbreviation for: 
the function f such that f(x) = x? for all x. 


This notation for the integral is as flexible as the notation lim f(x). Several ex- 


Peat Gf 


amples may aid in the interpretation of various types of formulas which frequently 
appear; we have made use of Theorems 5 and 6.* 


g Q Pe ae 
(1) (x+ y)dx = i; Kax +/ yox — ae a + y(b—a). 
a . a e a < 
Re oll 
(2) i wtody= | ydy + | td = Sy ae 


b x) b 
(3) / (| (-+nds) CDS (l+r)(x —a)dx 


b 
=f) + | (x —a)dx 


y 


_( Do a | 
=(1l+r) es 

b d b d- C2 

(4) i (| (x + y) ay) ax = / x(d —c) + xls dx 

a C¢ a oe —. 
a dz c2 : b y 
ak eae: (bE a) 4+ (iS @) : XaX 
a d- ¢ b2 a 
aa ao (b—a)+(d—-c) me oD 


The computations of i x dx and ie x? dx may suggest that evaluating integrals 
is generally difficult or impossible. As a matter of fact, the integrals of most func- 
tions ave impossible to determine exactly (although they may be computed to any degree 
of accuracy desired by calculating lower and upper sums). Nevertheless, as we shall see in 
the next chapter, the integral of many functions can be computed very easily. 

Even though most integrals cannot be computed exactly, it is important at least 
to know when a function f zs integrable on [a,b]. Although it is possible to say 
precisely which functions are integrable, the criterion for integrability is a litte too 
difficult to be stated here, and we will have to settle for partial results. The next 
Theorem gives the most useful result, but the proof given here uses material from 
the Appendix to Chapter 8. If you prefer, you can wait until the end of the next 
chapter, when a totally different proof will be given. 


* Lest chaos overtake the reader when consulting other books, equation (1) requires an important 


el . rye ‘ . eb . . ~ . “ 

qualification. ‘This equation interprets f{, ydx to mean the integral of the function f such that cach 
ane : : , ; ob : 

value f(x) 1s the number y. But classical notation often uses y for y(v), so f ydx might mean the 


integral of some arbitrary function y. 
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THEOREM 3 


PROOF 


If f is continuous on [a,b], then f is integrable on [a, b]. 


Notice, first, that f is bounded on [a,b], because it is continuous on [a,b]. To 
prove that f is integrable on [a,b], we want to use Theorem 2, and show that for 
every € > O there is a partition P of [a,b] such that 


Uf i) — LP) <=: 


Now we know, by Theorem | of the Appendix to Chapter 8, that f is uniformly 
continuous on [a,b]. So there is some 6 > O such that for all x and y in [a, b], 


| € 
if |x — y| < 6, then [f(x) — f(y)| < NO) 


The trick is simply to choose a partition P = {to,...,¢:} such that each |t; —14;_-1| < 
6. Then for each 7 we have 


If (x) — fy) < 


—— for all x, y mn [4.4]. 
2(b —a) ; [i-1- 4] 
and it follows easily that 

E é 


M; —-m; = ——_ = —.: 
2b=a) bd =a 


Since this 1s true for all 7, we then have 


SoM =r NE 157) 


il 


U(f, P) —L(f, P) 


A 
S 
| | 
bs) 

Ss 

| 
= 
| 


which is what we wanted. J 


Although this theorem will provide all the information necessary for the use of 
integrals in this book, it is more satisfying to have a somewhat larger supply of 
integrable functions. Several problems treat this question in detail. It will help to 
know the following three theorems, which show that f is integrable on [a,b], if it 
is integrable on [a,c] and [c, b]; that f + g is integrable if f and g are; and that 
c- f is integrable if f is mtegrable and c is any number. 

As a simple application of these theorems, recall that if f 1s 0 except at one 
point, where its value is 1, then f is mtegrable. Multiplying this function by c, it 
follows that the same is true if the value of f at the exceptional point is c. Adding 
such a function to an integrable function, we see that the value of an integrable 
function may be changed arbitrarily at one pomt without destroying imtegrability. 

3y breaking up the mterval into many subintervals, we see that the value can be 
changed at finitely many points. 

The proofs of these theorems usually use the alternative criterion for mtegrability 
in ‘Theorem 2; as some of our previous demonstrations illustrate, the details of the 
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argument often conspire to obscure the poit of the proof. It is a good idea to 
attempt proofs of your own, consulting those given here as a last resort, or as a 
check. This will probably clarify the proofs, and will certainly give good practice 
in the techniques used in some of the problems. 


THEOREM 4 Leta <c <b. If f is integrable on [a,b], then f is integrable on [a,c] and on 
[c. b]. Conversely, if f is integrable on [a,c] and on [c, b], then f is integrable 
on [a,b]. Finally, if f is integrable on [a, b], then 


Prefels 


PROOF — Suppose f is integrable on [a,b]. If ¢ > O, there is a partition P = {to,..., tr} of 
[a,b] such that 
U(f, P)—L(f, P) <e. 


We might as well assume that c = ¢; for some j. (Otherwise, let Q be the partition 


which contains fo,.... t, anc ¢; then O comtains@P, so U(f, O) — LUf,. Q) = 
Of, PL f, Bienes 
INGw AP = {tpden. ¢;} is-a partition of la, c|andi2” — (ie. .m t,} is a partion 
f of [c. 6] (Figure 12). Since 
L(f, P) = Lf, PY ECE P 
Uf PySuGPyPuP), 
re we have 
a= tj =c t= b PU Gf PS ISG ee Pe Cie ee Lifer VI 0 (fe Pye et 
Since each of the terms in brackets is nonnegative, each is less than ¢. ‘This shows 
FIGURE 12 that f is integrable on [a,c] and [c, b]. Note also that 


Ber 3 <|/ f<UUP%, 
a 
iGaeires) = USP’), 


so that 


G b 
Lipp) = f ae i MS OO EE 


a c 


Since this is true for any P, this proves that 


b a 
[oa] f= 


Now suppose that f is integrable on [a, of and-on |je,b). [te = Othere isa 
partiuon P’ of [a,c] and a partition P” of [c, b] such that 


Ce Vie Gh. Pay <a oy 
ign Pa) Sing f, P") < eyZ. 
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THEOREM 5 


PROOF 


If P is the partition of [a,b] containmg all the points of P’ and P”, then 


XG gE Ge AE 18 ee 
CCP) Uf, P ) + tee). 


consequently, 


OCP = ha — Gwe Ne en Cae em 


Theorem 4 is the basis for some minor notational conventions. The integral 


b 23,0 
f{, f was defined only for a < b. We now add the definitions 


a b a 
/ yf = (() and ap = f iit @ se Ib. 
a b 


a 


es ee ; : b b 
With these definitions, the equation aly ft iE fe if f holds for all a,c, b even 
if a <c < bis not true (the proof of this assertion is a rather tedious case-by-case 


check). 


If f and g are integrable on [a,b], then f + g ts integrable on [a, b] and 


b b b 
+e | f+ f a 


Let P ={%, >>>, ibe any partition of ia]. Let 


m; = imi {( f Se oyee) fj =| Sa ee 
HT; = lnteilet aul lie Semcntal 
in; = int (es) Bee ae 


and define M;, M;’, M;” similarly. It is not necessarily true that 
m; =m; +m,;”, 

but it 1s true (Problem 10) that 
m; >m, +m,”. 


Sunilarly, 
M; < M;'+M,". 


Therefore, 
bbfaP) + L(g, P)e<ele (fee) 


and 
OG tg.) SOE) a a ta) 


Thus, 
LU, P) +L i Ge eek P) Uo ee ce) 
Since f and g are integrable, there are partitions P’, P” with 


CORTE LSC ye 2 
COENP YL mP y= eZ. 


THEOREM 6 


PROOF 
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lie? Gontans both Pe” and. P, then 
Gh ae Ue) mney oe aera Graeme. 


and consequently 
U(f+g.P)-—L(f +g, P) <e. 


This proves that f +g is mtegrable on [a,b]. Moreover, 


(1) LCP) i gar) Lf aoe 
b 


< | (f+) 
<U(f +e, P) < Uf?) +UG@, P); 


and also 


b b 
(2) Lf, P) + L(g. rvs | r+ f g =U (fr P) A Ue, 2): 


Since U(f, P) — L(f, P) and U(g, P) — L(g, P) can both be made as small as 
desired, it follows that 


ORG ey ys st WS Mr EN EY of Naa 10 a | 


can also be made as small as desired; it therefore follows from (1) and (2) that 


b b b 
ste= | f+ | oA 


If f is mtegrable on [a,b], then for any number c, the function cf is integrable 


on [a,b] and 
b b 
Mio} 


The proof (which is much easier than that of Theorem 5) is left to you. It is a good 
idea to treat separately the cases c > 0 and c < 0. Why? f 


(Theorem 6 is just a special case of the more general theorem that f - g is 
integrable on [a,b], if f and g are, but this result 1s quite hard to prove (see 


Problem 38).) 


In this chapter we have acquired only one complicated definition, a few simple 
theorems with mtricate proofs, and one theorem which required material from the 
Appendix to Chapter 8. This is not because integrals constitute a more difficult 
topic than derivatives, but because powerful tools developed in previous chapters 
have been allowed to remain dormant. ‘The most significant discovery of calculus 
is the fact that the mtegral and the derivative are mtimately related—once we 
learn the connection, the integral will become as useful as the derivative. and as 
easy to use. The connection between derivatives and integrals deserves a separate 
chapter, but the preparations which we will make in this chapter may serve as a 
hint. We first state a simple inequality concerning integrals, which plays a role in 
many important theorems. 
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FAG RSE aS 


THEOREM 7 


PROOF 


area 


Fict+h)— FC) 


THEOREM 8 


PROOF 


Suppose f is integrable on [a,b] and that 
m < f(x) <M for all x in [a, bd}. 
Then : 
m(b —a) <| f <M(b-a). 


It is clear that 
m(b—a)<L(f,P) and U(Cf, P) < M(b-a) 


for every partition P. Since i f =SsupteG 2 mt) ning desired 
inequality follows immediately. § 


Suppose now that f is integrable on [a,b]. We can define a new function f on 


[a, b| by . | 
FQ) = if fe in (ty ai. 


(This depends on Theorem 4.) We have seen that f may be mtegrable even if it 
is not continuous, and the Problems give examples of integrable functions which 
are quite pathological. The behavior of F is therefore a very pleasant surprise. 


If f is integrable on [a,b] and F is defined on [a,b] by 


F(x) = i ds 


Suppose c is in [a,b]. Since f is integrable on [a, b] it is, by definition, bounded 
on [a, b]; let M be a number such that 


[f(x)| < M for all x in [a, b). 
if h > 0, then (Figure 13) 


cth G cth 
Fit h)— Feo) = | r-f pe) iE 


—M < f(x) <M for all x, 


then F is continuous on [a, b]. 


Since 


it follows from Theorem 7 that 
cth 
—-M-h< / f <Mh; 
in other words, 
(1) —-M-h< F(ct+h)— F(c)<M-h. 


If h <0, a similar imequality can be derived: Note that 


cth c 
Ficth)—- F(d= = -| ie 
Jc cth 
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Applying Theorem 7 to the interval [¢ +h, c], of length —h, we obtain 


Mh < f <—Mh; 


cth 
multiplying by —1, which reverses all the inequalities, we have 
(2) Mh < F(c+h) — F(c) < —Mh. 
Inequalities (1) and (2) can be combined: 
|F(c +h) — F(c)| < M- |Al. 
Therefore, if ¢ > 0, we have 
|\F(c +h) — F(c)| < €, 
provided that |h| < ¢/M. This proves that 
lim F(c +h) = F(e); 


in other words F is continuous at c. J 


Figure 14 compares f and F(x) = J’ f for various functions f; it appears 
that F is always better behaved than f. In the next chapter we will see how true 


this is. 
at 
F f 
/ a —_ 4 


EY 
a a 
/ . 

t — 

a a f a F 
fe | 
—_— 
a 


FIGURE 14 
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PROBLEMS 


1. 


*3. 


de 


ih . ; a , : 
Prove that fy x3dx = b*/4, by considering partitions into 2 equal subin- 
H 


: : 2) : 5 ; ae 
tervals, usmg the formula for ) i” which was found in Problem 2-6. ‘This 
i=] 
problem requires only a straightforward imitation of calculations in the text, 
but you should write it up as a formal proof to make certain that all the fine 
) I 
points of the argument are clear. 


eee |: : Dea aye: 
Prove, similarly, that NG x"dx = BOS 
H 
(a) Using Problem 2-7, show that the sum SS k? /n?*! can be made as close 
k=l 
to 1/(p + 1) as desired, by choosing n large enough. 
b 
(b)) Prove that ji, x? dx — bP*!1(p+]). 


b 

This problem outlines a clever way to find / x? ax for 0 <sde< does (lilie 

a 
result for a = 0 will then follow by continuity.) The trick is to use partitions 
P = {tio}e.2, f2)Mtom which all rafies r = t,/%;2, are*equal; mstead of using 
partitions for which all differences t; — f;-; are equal. 
(a) Show that for such a partition P we have 
b 


forc = —. 
a 


ij =a-cl/" 


(b) If f(x) =+x?, show, using the formula m Problem 2-5, that 


U(f, P) =a =") er 


= 
ie co l/n 
— ¢,Ptl — pptly)(ptl)/n 
= 8 bY ye are 
l 


ee Pi a ee 
(b “ de | SE a lla 0 1) 


and fimd a similar formula for L(f. P). 
(c) Conclude that 


b ; ppt! Bae! 
x? dx = ————___.. 
a ‘Psa | 


(You nught find Problem 5-41 useful.) 


Evaluate without domg any computations: 


] 
(1) i iy ae ee 
= 
] 
(ii) i (0° + 3)/1 — 4x2 dx. 
-! 
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Prove that 


/ WM 
o t+1 


Decide which of the following functions are integrable on [0, 2], and calculate 
the integral when you can. 


for all x > O. 


* M7 Ae x, 0 = ees 
(i) f=) 49 | ep pace 
oy = a x, O<x . 
(11) =) 9 Lae = 2) 


(mi) f(x) =x+ [x]. 
x+ [x], x rational 
0, x irrational. 


iv) 7a) = | 


f(x) = | 1. x of the form a+ b/2 for rational a and b 
‘ 0, x not of this form. 


(ai O<x <= | 
(wi) Fon = =| 

0, % = 0:0r' x > i. 
(vil) f is the function shown in Figure 15. 


FIGURE 15 


Find the areas of the regions bounded by 


> 


(i) the graphs of f(x) = x? and g(x) = a +2. 

(ii) the graphs of f(x) = x? and g(x) = —x? and the vertical lines through 
(—1,0) and (1, 0). 

(iii) the graphs of f(x) = x? and g(x) = 1 — x?. 

(iv) the graphs of f(x) = x? and g(x) = | — x* and A(x) =2. 

(v) the graphs of f(x) = x? and g(x) = x? — 2x + 4 and the vertical axis. 
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10. 


11. 


re. 


13. 


14. 


(vi) the graph of f(x) = Vx, the horizontal axis, and the vertical line 
3 


through (2,0). (Don't try to find if Vx dx; you should see a way of 
guessing the answer, using only integrals that you already know how to 
evaluate. ‘The questions that this example should suggest are consicered 
in Problem 21.) 


Find 


b d 
Hata Fosenrady) dx 


. b d mre . . . . . 
in terms of f{) f and f° g. (This problem is an exercise in notation, with a 


vengeance; it is crucial that you recognize a constant when it appears.) 
Prove, using the notation of Theorem 5, that 
f . ~ 
mj; +m,” = inf { f(x) + e(x2) 2-1 < 41.42 < th} <j. 


(a) Which functions have the property that every lower sum equals every 
upper sum? 

(b) Which functions have the property that some upper sum equals some 
(other) lower sum? 

(c) Which continuous functions have the property that all lower sums are 
equal? 


*(d) Which integrable functions have the property that all lower sums are 


equal? (Bear in mind that one such function is f (+) = 0 for x irrational, 
J) S77 for x =" p/¢ m lowest terms.) “—Fint: You will need! the 
notion of a dense set, introduced in Problem 8-6, as well as the results of 


Problem 30. 


Ifa <b<c<d and f is integrable on [a, d], prove that f is integrable on 
[b,c]. (Don’t work hard.) 


(a) Prove that if f is integrable on [a,b] and f(x) = 0 for all x mn [a,b], 


b 
then tee Ol 
a 
(b) Prove that if f and g are integrable on [a,b] and f(x«) = g(x) for all x 
b bh 
in |a, b], then f=] g. (By now it should be unnecessary to warn 


a a 
that if you work hard on part (b) you are wasting time.) 


Prove that 
b b+e 
Ji@dx = f(x -—e)dx. 
a ag 
(The geometric interpretation should make this very plausible.) Hint: Every 
partition "P= {igs nt, Pot [t'b] “gives rise fo a partihen P = ip +c; 
..soh 4+ ¢} of [a+ e¢,5 +c], and conversély, 


#15. 


=16. 


| Pg 


18. 


WE 


20. 
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For a,b > 1 prove that 


a ] b ] ab ] 
| var + f var = | — dt. 
Ib wh ; ¢ i 8 


ab 
| 1/tdt. Every partition P = 
b 


a 
Hit: This can be written | l/tdt 
| 


{%, . weap fofelilvalm@awesenise to a parition P&= (big, ..., bt,} of [b, ab], 


and conversely. 
ch b 
/ ¥@Q)d4= 2) f(et)dt. 
ca a 


(Notice that Problem 15 1s a special case.) 


Prove that 


Given that the area enclosed by the unit circle, described by the equation 
x? 4 y? = 1, is 2, use Problem 16 to show that the area enclosed by the 
ellipse described by the equation x7/a? + y?/b? = 1 is wab. 


b 
This problem outlines yet another way to compute / x" dx; it was used by 
a 


Cavalieri, one of the mathematicians working just before the invention of 
calculus. 


l a 
(a) etc = | x" dx. Use Problem 16 to show that i x"dx =c,a"*!, 
0 0 


(b) Problem 14 shows that 


2a a 
| Take Ea / (x +a)" dx. 
0 —a 


Use this formula to prove that 


Nl 
oN eas | = Iqttl S Qn 


k even 
(c) Now use Problem 2-3 to prove that c, = 1/(n + 1). 


Suppose that f is bounded on [a,b] and that f is continuous at each point 
in [a,b] with the exception of x9 in (a,b). Prove that f is mtegrable on 
[a,b]. Hint: Imitate one of the examples in the text. 


Suppose that f is nondecreasing on [a,b]. Notice that f is automatically 
bounded on [a,b], because f(a) < f(x) < f(b) for x in [a, b}. 


Qjalf 2S igee.-(,) sa partition of [a,b], whatnsaGee anc "om )? 

(b)_.ouppese thats, — 1,1; = 6 for each 4. ProvethamG (7, P) — Lf, P) = 
s[ f(b) — Fla)}. 

(c) Prove that f is integrable. 

(d) Give an example of a nondecreasing function on [Q, 1] which is discon- 
tinuous at infmitely many poimts. 
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total area b- f-!(b) 


fae) 


o2 ls 
@ 
a b 
area a- ja (a) 
FIGURE 16 
iis) 
ii 113) 
22. 


h Mp) 


Pa 
J '(ty) - 


It might be of interest to compare this problem with the following extract 
from Newton’s Prncipra.* 


LEMMA IT 


Lf in any figure Aack, terminated by the night hnes Na, AE, and the curve ack, 
there be inscribed any number of parallelograms Ab, Be, Gd, &c., comprehended 
under equal bases AB, BC, CD, &c., and the sides, Bb, Gc, Dd, &e., parallel 
to one side Aa of the figure; and the parallelograms alX\bl, bLem, cMdn, @c., are 
conipleted: then if the breadth of those parallelograms be supposed to be diminished, 
and their number to be augmented in infinitum, J say, that the ultimate ratios 
which the mscribed figure AKbLCMdD, the circumscribed figure Aalbmendok, 
and curvihnear figure AabcdE, will have to one another, are ratios of equahty. 

For the difference of the inscribed and circumscribed figures 1s the 
sum of the parallelograms Kl, Lm, Mn, Do, that is (from the equality 
of all their bases), the rectangle under one of their bases Kb and the 
sum of their altitudes Aa, that is, the rectangle ABla. But this rectangle, 
because its breadth AB is supposed diminished i infinitum, becomes less 
than any given space. And therefore (by Lem. 1) the figures inscribed 
and circumscribed become ultimately equal one to the other; and much 
more will the intermediate curvilinear figure be ultimately equal to either. 


O.E.D. 


Suppose that f is increasing. Figure 16 suggests that 


b fab) 
(ee a a i i 
a f(a) 
(a) If wij... tn} 1s a partition off 0p) lGue aay eee 


P 
Semel ie Prove that, as suggested in Figure 17, 
Loe... Py+UCf.P)=bf' (Oetafe “@)| 


(b) Now prove the formula stated above. 
b 


(c) Find Veeoetor 0 =a <b. 

a 
Suppose that f is a continuous increasing function with f(0) = 0. Prove 
that for a,b > Owe have loung’s inequahty, 


a b 
ab < [ fayax = | af ear 
0 0 


and that equality holds if and only if b = f(a). Hint: Draw a picture like 
Figure 16! 


* Newton's Principia, A Revision of Mott’s Translation, by Florian Cajori. University of California 
FIGURE 17 Press, Berkeley, California, 1946. 


233 
GORE, 13 24. 
f 
* 
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FIGURE 19 
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(a) Prove that if f is integrable on [a,b] and m < f(x) < M for all x in 
[a,b], then 


b 
/ f(xjdx = (b-—a)p 


a 


for some number jz with m < up < M. 
(b) Prove that if f is continuous on [a, b], then 


b 


Os) dae "(Dea aie) 


a 


for some & in [a,b]. 
(c) Show by an example that continuity is essential. 


(d) More generally, suppose that f is continuous on [a,b] and that g is 
integrable and nonnegative on [a,b]. Prove that 


b b 
[ feoecnax = re | e(x) dx 


for some € in [a,b]. ‘This result is called the Mean Value Theorem for 
Integrals. 


(ec) Deduce the same result if g 1s integrable and nonpositive on [a, b]. 


(f) Show that one of these two hypotheses for g is essential. 


In this problem we consider the graph of a function in polar coordinates 
(Chapter 4, Appendix 3). Figure 18 shows a sector of a circle, with central 
; Now 


consider the region A shown in Figure 19, where the curve is the graph in 
polar coordinates of the continuous function f. Show that 


. . . . . 9 
angle 6. When @ is measured in radians, the area of this sector is r- - 


Ww| D2 


ay . 
area A = = J (@) ae. 
4 
~ Je 
Let f be a continuous function on [a,b]. If P = {to..... t,} is a partition of 


[a, b|, define 


HW 


Li face Be 


i=] 


= 9) 


(anaes ie} Ge) 
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The number €(f, P) represents the length of a polygonal curve inscribed in 
the graph of f (sec Figure 20). We define the length of f on [a,b] to be 


Z the least upper bound of all €(f, P) for all partitons P (provided that the sct 
7 i of all such €(f, P) is bounded above). 


(a) If f is a lnear function on [a,b], prove that the length of f is the 
distance from (a, f(a)) to (b, f(b)). 


—_ 2s (b) I f is not hnear, prove that there is a partition P = {a,t, b} of [a,b] 
= hn #8) a ty =b such that €(f, P) is greater than the distance from (a, f(a)) to (b, f(b)). 
(You will need Problem 4-9.) 
oe (c) Conclude that of all functions f on [a,b] with f(a) =c and f(b) =d, 


the length of the Imear function is less than the length of any other. (Or, 
in conventional but hopelessly muddled terminology: “A straight linc is 
the shortest distance between two points”.) 

(d) Suppose that f’ is bounded on [a,b]. If P is any partition of [a,b] 
show that 


L(V EG oe P) =< 0 Py suv 1 +f) ap). 
Hint: Use the Mean Value ‘Vheorem. 
(e) Why ts sup{ L(V Peet), Py} < sup{€(f, P)}? (his is casy.) 
(f) Now show that sup{é(f. P)} < inf {U(V1 + tow Py, thereby proving 


b 
that the length of f on [a,b] is / ATEN a git ee (Gi is Inte- 
grable on [a,b]. Hint: It suffices to show that if P’ and P” are any two 
partitions, then €(f, P’) < U(v 1+ Ge Pp”). If P contains the points 
of both P’ and P”, how does €(f, P’) compare to €(f, P)? 

(2) Let £(v) be the length of the graph of f on [a,x], and let d(x) be the 
Iength of the straight line segment from (a, f(a)) to (x, f(x)). Show 
that if V1 + (f')? is integrable on [a,b] and f’ is continuous at a (ic. 
Y hn ¥ G7 (@silien 

x—at 


Pix 
oe 


1m 
x>at d(x) 


Hint: It will help to use a couple of Mean Valuc ‘Theorems. 


(h 


In Figure 21, the part of the graph of f between 5 and 5 is just half the 


size of the part between t and 1, the part between 5 and : is Just half 
the size of the part between j and 4, etc. Show that the conclusion of 
FIGURE 21 part (g) does not hold for this f. 


26. A function s defined on [a, b] is called a step function if there is a partition 
n= {lye tn} of [a,b] such that s is a constant on each (¢;—1. t;) (the valucs 
of s at t; may be arbitrary). 


(a) Prove that if f is integrable on [a,b], then for any ¢ > 0 there 1s a step 


rat: 


28. 


29; 


*30. 


43s 
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b b 
function s; < f with | cA | 5, < €, and also a step function s2 > f 
a 
b b 
with s2—- GeO 


a a 
(b) Suppose that for all e¢ > 0 there are step functions s; < f and sz > f 


b b 
such that / 52 — i s, <&. Prove that f is integrable. 
a a 
b 
(c) Finda function f which is not a step function, but which satisfies = 
L(f. P) for some partition P of [a,b]. 


Prove that if f is integrable on [a, b], then for any ¢ > O there are continuous 
b 

functions g < f < h with | h -| g <e. Hint: First get step functions 

with this property, and then continuous ones. A picture will help immensely. 

(a) Show that if sy and s2 are step functions on [a,b], then s; + s2 1s also. 


b b b 
(b) Prove, without using Theorem 5, that (sj + 52) =| S| +f 


(c) Use part (b) (and Problem 26) to give an ‘alternative proof of Theorem 5. 


Suppose that f 1s Eaten on [a,b]. Prove that there is a number x in 
b 


[a,b] such that —_ = f. Show by example that it is not always possible 
to choose x to be. in (a, b). 


The purpose of this problem is to show that if f is integrable on [a, b], then 
f must be continuous at many points in [a, 5]. 


(a) Leal? Sie t,} be a partition of [a,b] with UCf. P) — L(f, P) < 
b —a. Prove that for some i we have M; — m; < 1. 

(b) Prove that there are numbers a; and bj with a < a; < hy < b and 
Sup (Ae) = peer yt f Ce) 2 ay Se = Db) <etl eow cantaheose 
[a1, b1| = [t4)-1. «| from part (a) unless / = | or n; and in these two cases 
a very simple device solves the problem.) 

(c) Prove that there are numbers ay and b2 with a; < a2 < bo < by and 
sup{ f(x) :a2 <x < bo} —inf{ f(x) ia <x < by) < 5. 

(d) Continue in this way to find a sequence of intervals J, = [dn, bn] such 
that supi icy eat) — inf { f(x): x in 7,} < Pam Apply the "Nested 
Intervals Theorem (Problem 8-14) to find a point x at which f is con- 
tinuous. 

(ec) Prove that f is continuous at infinitely many points in [a, b]. 


b 
Let f be integrable on [a,b]. Recall, from Problem 13, that / f= Ut 
f(x) = 0 for all x in [a, 5]. _ 


(a) Give an example where f(x) > 0 for all x, and f(v) > 0 for some x in 
b 
[a,b], and ane OU. 


a 
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792. 


Jo: 


-Oa. 


pahee 


*36. 


ois 


(b) Suppose f(x) > O for all x in [a,b] and f is continuous at xo in [a, b] 
b 


and f (x9) > 0. Prove that f > 0. Hint: It suffices to find one lower 


sum LC, Pyewhicli4s positive. 
(c) SUP POs f is integrable on [a,b] and f(x) > 0 for all x in [a, b]. Prove 


that [ f > 0. Hint: You will need Problem 30; indeed that was one 


reason for including Problem 30. 


b 
(a) Suppose that f is continuous on [a, b] and fg =0 for all continuous 


é r a . . . 
functions g on [a,b]. Prove that f = 0. (This is easy; there is an obvious 
g to choose.) 


(b) Suppose f is continuous on [a,b] and that fg = 0 for those con- 


tinuous functions g on [a,b] which satisfy the extra conditions gia) 
g(b) = 0. Prove that f = 0. (This innocent looking fact is an important 
lemma in the calculus of variations; see reference [22] of the Suggested 
Reading.) Hint: Derive a contradiction from the assumption f (x9) > 0 
or f (x9) < 0; the g you pick will depend on the behavior of f near xo. 


ket (Co) i= » forsetratonaliand (() =!0"for x irrational: 
(a) Compute L(f, P) for all partitons P of [0,1]. 
(b) Find inf{U(f, P): P a partition of [0, 1)}. 


Let f(x) = O for irrational x, and 1/q if x = p/q in lowest terms. Show 


that f is integrable on [0,1] and that / f = 0. (Every lower sum is clearly 
; 0 
0; you must figure out how to make upper sums small.) 


Find two functions f and g which are integrable, but whose composition 
§ = 3 
go f isnot. Hint: Problem 34 is relevant. 


Let f be a bounded function on [a,b] and let P be a partition of [a, 5]. 
Let M; and m; have their usual meanings, and let M;’ and m;’ have the 
corresponding meanings for the function | f}. 


a) Prove that Mj’ —m;’ < M; —imj,. 

b) Prove that if f is integrable on [a, b], then so is |f]. 

c) Prove that if f and g are integrable on [a,b], then so are max(f. g) and 
min(f. g). 

‘d) Prove that f is integrable on [a,b] if and only if its “positive part” 
max(f.0) and its “negative part” min(f, 0) are integrable on [a. 5]. 


Prove that if f is integrable on [a, b], then 


b 
Z / ilar. 


Hint: ‘Vis follows easily from a certain string of inequalities; Problem 1-14 


b 


f(dt 


a 


is relevant. 


*38. 


39. 


*40. 
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Suppose f and g are integrable on [a,b] and f(x), g(x) = O for all x in 
[a,b]. Let P be a partition of [a,b]. Let Mj’ and m;’ denote the appropriate 
sup’s and inf’s for f, define M;” and m;” similarly for g, and define M; and mj; 
similarly for fg. 

(a) Prove that M; < M,'M;" and m; > m,’m;". 

(b) Show that 


n 


U(fg. P) — L(fg. P) < > [Mi Mi" — mj’mj"\(ti — 1). 
ll 
(c) Using the fact that f and g are bounded, so that | f(x)|, |g(«)| < M for 
x in [a, b], show that 


U(fg,P) = Lua 


< M [um = m; | (ti — cy) + y > [Mi" = mi \ (ti - 1-0] 3 
i 


i=] 
(d) Prove that fg is integrable. 
(e) Now eliminate the restriction that f(x), g(x) > 0 for x in [a,b]. 


Suppose that f and g are integrable on [a,b]. The Cauchy-Schwarz inequality 


states that , > P b 
() < )(L*) 


(a) Show that the Schwarz inequality is a special case of the Cauchy-Schwarz 
inequality. 

(b) Give three proofs of the Gauchy-Schwarz inequality by imitating the 
proofs of the Schwarz inequality in Problem 2-21. (The last one will 
take some imagination.) 

(c) If equality holds, is it necessarily true that f = Ag for some A? What if 
f and g are continuous? 


ey? 1 
(d) Prove that (| i) = (/ @) Is this resulttruc if_Osand | axe 
0 0 


replaced by a and b? 


Suppose that f is integrable on [0, x] for all x > Oand hm f(x) =a. Prove 


that 
1 x 

lim -| f(t) da=a. 

30 10,8) DE 0 
Hint: The condition hm f(x) = a@ implies that f(t) is close to @ for 

xO) 

N+M 

t > some N. This means that f(t) dt is close to Ma, If M is large in 


N 
comparison to N, then Ma/(N + M) 1s close to a. 
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FIGURE 1 
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APPENDIX. RIEMANN SUMS 


Suppose that P = {fo,....f,} 1s a partition of [a,b], and that for each i we 
choose some point x; in |f;-1, ¢;]. Then we clearly have 


H 
i Fe f ONG, he Oe Py, 
Nn i=l 
Any sum 2 f(x) (ti — G1) 1s called a Riemann sum of f for P. Figure | shows 
the Pee interpretation of a Riemann sum; it is the total area of n rectangles 
that he partly below the graph of f and partly above it. Because of the arbitrary 
way in which the heights of the rectangles have been picked, we can’t say for 


sure whether a particular Riemann sum is less than or greater than the mtegral 
b 


f(x)dx. But it does seem that the overlaps shouldn’t matter too much; if the 


a 
bases of all the rectangles are narrow enough, then the Riemann sum ought to be 
close to the mtegral. ‘The followmg theorem states this precisely. 


Suppose that f is integrable on [a,b]. Then for every ¢ > 0 there is some 6 > 0 
such that, if P = {f,...,¢,} is any partition of [a, b] with all lengths t; —t;_1 <6, 
then 


i b 
FOG in jomsh ace fan (aCe reat. 


i=l 


for any Riemann sum formed by choosing x; in [f;—1. t]. 


Since the Riemann sum and the integral both he between L(f, P) and U(f. P), 
this amounts to showing that for any given € we can make U(f, P) — L(f, P) < 
by choosing a 6 such that U(f, P) — L(f, P) < € for any partition with all lengths 
i, — fon 

The definiton of f being integrable on [a,b] includes the condition that | f] < 
M for some M. First choose some particular partition P* = {uo...... ux} for 
which 


CPAP* \x— do(efinP* Ve xe/2. 


and then choose a 6 such that 


ee ek 
* 4MK 


For any partition P with all ¢ — 4-1 < 46, we can break the sum 


H 
U(f, P) — Lf, P) = (Mi — mi) (4 = 41-1) 

i=l 
into two sums. The first involves those i for which the interval [t;—1, | 1s com- 
pletely contained within one of the intervals [wj;—1.u;]. ‘This sum is clearly < 
UC, P™) — E(ffP) = e7 For all ot lttr (we willthave 7) <4, = & for 
Solre y(n lees. K — 1, so there are at most K — 1 of them. Consequently, the 
sum for these terms is < (K — 1)-2M-6 < ¢/2.J 


(u(ty), v(t1)) 


HG RIE 2 


(u(to), v(to)) 
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The moral of this tale is that anything which looks like a good approximation 


to an mtegral really is, provided that all the lengths ¢; — t;-; of the intervals in the 


partition are small enough. Some of the following problems should brmg home 


this message with even greater force. 


PROBLEMS 


Suppose that f and g are continuous functions on [a,b]. For a partition 
P= {fo,.-2-, 4,208 [ab] choosea set of pomis x; in [7;_1,7;|| and anothemset 
of points uw; m [f;-1, ti]. Consider the sum 


3 Fig Gi) (i — 4-1). 
i 


Notice that this 1s not a Riemann sum of fg for P. Nevertheless, show that 
b 


all such sums will be within ¢ of fg provided that the parttion P has all 


a 
lengths ¢; — 4;-; small enough. Hint: Estimate the difference between such a 
sum and a Riemann sum; you will need to use uniform continuity (Chapter 
8, Appendix). 


This problem is similar to, but somewhat harder than, the previous one. 
Suppose that f and g are contmuous nonnegative functions on [a,b]. Fora 
partition P, consider sums 


= Vv fi) + eG) G — G-1). 


i=] 
b 
Show that these sums will be within ¢ of Jf +g wall t; —4_) are small 


a 
enough. Hint: Use the fact that the square-root function is uniformly con- 
tinuous on a closed interval [0, /]. 


Finally, we're ready to tackle something big! (Compare Problem 13-25.) 
Consider a curve ¢ given parametrically by two functions uw and v on fa, b]. 
For a partition P = {to,...,t,} of [a,b] we define 


[e(t)) — u(j_1)]? + [v@) — v@_1)]?; 


i 


this represents the length of an mscribed polygonal curve (Figure 2). We 
define the length of ¢ to be the least upper bound of all €(f, P), if it exists. 
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Prove that if uw’ and v’ are continuous on [a, b], then the length of ¢ is 
u’> + v’?, 


Let f’ be continuous on the interval [4,6)]. Show that the graph of f in 
polar coordinates on this interval has the length 


% 


Using Theorem |, show that the Cauchy-Schwarz inequality (Problem 13-39) 
is a consequence of the Schwarz inequality. 


CHAPTER 


THEOREM 1 (THE FIRST 
FUNDAMENTAL THEOREM 
OF CALCULUS) 


PROOF 


THE FUNDAMENTAL THEOREM 
OF CALCULUS 


From the hints given in the previous chapter you may have already guessed the 
first theorem of this chapter. We know that if f is integrable, then F(x) = [ i ns 
continuous; it is only fitting that we ask what happens when the original function f 
is continuous. It turns out that F is differentiable (and its derivative is especially 
simple). 


Let f be integrable on [a,b], and define F on [a, b] by 


Faye ip f. 


If f is continuous at c in [a, bj, then F is differentiable at c, and 
F(c) = f(@)- 


Uf ¢ =a or b, then F’(c) is understood to mean the right- or left-hand derivative 
ol 7.) 


We will assume that c is in (a,b); the easy modifications for ¢c = a or b may be 
supphed by the reader. By definition, 


F h) — Fic 
F'(c) = lim (c+ x a 
1— 1 


Suppose first that h > 0. Then 
cth 
Ficthy- F=f be 
Define m, and M,, as follows (Figure 1): 


Vee it ty Ge) 
Vie SsuD (0). « 


< X 
tat): 


It follows from ‘Theorem 13-7 that 


cth 
mp-h< / Ff <M, -h. 
fs 


‘Therefore 


Fi(c +h) — F(c 
Mp < Peppy : () = My). 
1 


If h <0, only a few details of the argument have to be changed. Let 


ay =iml{ f(x) C42 = x =e); 
Mi SUD (AC Ca i Se =). 
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c c+h 


FIGURE 1 
Then 
c 
mp > (—h) < il f <= M,-(-h). 
ct+h 
Since 


ct+h c 
Fic+h)— Fe) = | s=-f f 
c cth 


m,-h> F(c+h)— F(c) > M,-h. 


this yields 


Since h < 0, dividing by / reverses the inequality again, yielding the same result 
as before: 
F(e-+-h)— Fe) 
mys = h- 
h 
This inequality is true for any integrable function, continuous or not. Since f is 
continuous at c, however, 


lim my, = lim M,, = f(c), 
h—-O h—-O 


and this proves that 
Pie +h): —F(e) 


F’(c) = lm —————- =_ f(c): J 
h—0 h 


Although Theorem | deals only with the function obtained by varying the upper 
limit of integration, a simple trick shows what happens when the lower limit is 


oh 
G(x) = | 3 


varied. If G is defined by 


COROLLARY 


PROOF 
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then 
b x 
aw=for- fs 
a a 
Consequently, if f is continuous at c, then 
Gi(c) =—f(c). 


The minus sign appearing here is very fortunate, and allows us to extend Theo- 
rem | to the situation where the function 


F(x) = i f 


is defined even for x <a. In this case we can write 
a 
F(x) = — / ie 
aXe 


F (Gee f (c)) = f(c), 


so if ¢c < a we have 


exactly as before. 

Notice that in either case, differentiability of F at c is ensured by continuity of f 
at c alone. Nevertheless, Theorem | is most interesting when f is continuous at 
all points in [a,b]. In this case F is differentiable at all points in [a,b] and 


F=f. 


In general, it 1s extremely difficult to decide whether a given function f is the 
derivative of some other function; for this reason ‘Theorem 11-7 and 
Problems 11-60 and 11-61 are particularly interesting, since they reveal certain 
properties which f must have. If f is continuous, however, there is no problem 
at all-—according to Theorem |, f zs the derivative of some function, namely the 


F(x) = / &. 


Theorem | has a simple corollary which frequently reduces computations of 
integrals to a triviality. 


function 


If f is continuous on [a,b] and f = g’ for some function g, then 


b 
f = gb) — g(a). 


ies) = ip 3 


Then F’ = f = g’ on [a,b]. Consequently, there is a number c such that 


Let 


fae a0 C, 
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The number ¢ can be evaluated easily: note that 
0 = F(a) = g(a) +c, 


so c = —g(a); thus 


F(x) = g(x) — g(a). 


This is true, in particular, for x = b. ‘Thus 


b 
f = F(b) = g(b) — g(a). I 


ad 


The proofof this corollary tends, at first sight, to make the corollary secm useless: 
after all, what good is it to know that 


b 
f =2(b) — g(a) 


a 
‘ - ° Be am 5 ‘. : 
if g is, for example, g(x) = ff ? The point, of course, is that one might happen 
to know a quite different function g with this property. For example, if 
x? 
8) 


then g/(x) = f(x) so we obtain, without ever computing lower and upper sums: 


[ Cae b° wap 
: Xx sie 3 


One can treat other powers similarly; if # is a natural number and g(x) = 
x't+1 /(n + 1), then g’(x) = x",'so 


b i n+] n+l 
2. a 
i i — f 
a 


oe) = and”) f@)"= oa 


n+ |] Re | 


For any natural number n, the function f(x) = x~" is not bounded on any interval 
containing 0, but if a and b are both positive or both negative, then 


b / —n+t+l —n+] 
y) a 
i ras dx SS. i 
2 —n+1 -n+1 


Naturally this formula is only true for n 4 —1. Je do not know a simple expression for 


| b 1 
—dx. 
a * 


The problem of computing this integral is discussed later, but it provides a good 
opportunity to warn against a serious error. ‘Vhe conclusion of Corollary | 1s often 
confused with the definition of integrals——many students think that ihe f is defined 
as: “g(b) — g(a), where g is a function whose derivative is f.” This “definition” is 
not only wrong it is useless. One reason is that a function f may be integrable 
without being the derivative of another fimetion. For example, if f(x) = 0 for 
x #tand f(l) = 1, then f is mtegrable, but f cannot be a deriwative (why not?). 
Vhere is also another reason that is much more important: If f is continuous, 


THEOREM 2 (THE SECOND FUNDA- 
MENTAL THEOREM OF CALCULUS) 


PROOF 
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then we know that f = g’ for some function g; but we know this only because of 
Theorem 1. Vhe function f(x) = 1/x provides an excellent illustration: if x > 0, 


then f@) =-2'(s jaawhene 
| 
g(x) =| — dt, 
ime: 


and we know of no simpler function g with this property. 

The corollary to Theorem 1 is so useful that it is frequently called the Second 
Fundamental ‘Theorem of Calculus. In this book, that name is reserved for a 
somewhat stronger result (which in practice, however, is not much more useful). 
As we have just mentioned, a function f might be of the form g’ even if f is not 
continuous. If f is integrable, then it is still true that 


b 
f = g(b) — g(a). 


a 
The proof, however, must be entirely different—we cannot use Theorem 1, so we 
must return to the definition of integrals. 


If f is integrable on [a,b] and f = g’ for some function g, then 
b 


f = g(b) - g(a). 


a 


Let P = {to,....t,} be any partition of [a,b]. By the Mean Value Theorem there 
is a point x; m [f;-1, 4] such that 


g(ti) — g(4-1) = 2’ (DG — H-1) 
=f Oi Gals 


If 


hig = Init (0). eT | 
M; =sup{f(x): 4-1 <x < i}, 


then clearly 
mi(t; —% 1) [OGG S11) = 1) 


that is, 
Pibi (Ls alg eT) On 


Adding these equations for 7 = 1,...,# we obtain 


iW n 


> mili — 4-1) < g(b) — g(a) < Do Milt — 4-1) 


i=l i=l 
so that 


Ly Py = ab) = vay = UG, 2) 


for every partition P. But this means that 
b 


g(b)—g(av=] fl 


a 
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We have already used the corollary to Theorem | (or, equivalently, ‘Theorem 2) 
to find the integrals of a few elementary functions: 


E ee 
., Coe git! (a and b both positive or 
ae 
a 


mele +1" * alee both negative if n > 0). 


As we pointed out in Chapter 13, this integral does not always represent the area 
bounded by the graph of the function, the horizontal axis, and the vertical lines 
through (a, 0) and (6,0). For example, if a < 0 < b, then 


b 
/ xodx 
a 


does not represent the area of the region shown in Figure 2, which is given 
mstead by 


° oe M 34 0* at b+ =o 
-(f3 ax +f 8ax=- eer i Fim pr 
x ie iy 
7 ai a 
FIGURE 2 Similar care must be exercised in finding the areas of regions which are bounded 


by the graphs of more than one function—a problem which may frequently involve 
considerable ingenuity in any case. Suppose, to take a simple example first, that 
we wish to find the area of the region, shown in Figure 3, between the graphs of 
the functions 


f(x)=x? and gay=x 


on the interval [0,1]. If 0 < x < 1, then 0 < x? < x’, so that the graph of g hes 


below that of f(. The area of the region of interest to us 1s therefore 


area" Rf, 0, 1)— area R(g. 0, 1), 


Tl i 
Pie : Say ieee’ 
/ Xx ax— f Xx dx =3-— 4= 73: 
0 0 


‘This area could have been expressed as 


b 
/ Gite): 


If g(x) < f(x) for all x in [a, b], then this integral always gives the area bounded 


which is 


by f and g, even if f and g are sometimes negative. ‘The easiest way to see this is shown 
in Figure 4. If ¢ 1s a number such that f +c aud g +c are nonnegative on [a,b], 
FIGURE 3 then the region R;, bounded by f and g, has the same area as the region Ro, 


FIGURE 4 


fe 


eae 


(b) 


& |--- 
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bounded by f +c and g +c. Consequently, 


b b 
aréa ky = areal = sto- | (g+c) 
a 


a 


b 
= pe ne ae 


b 


= | (f-g). 


a 


This observation is useful in the following problem: Find the area of the regron 
bounded by the graphs of 


3 ez 
fQyex 2° “and. ex) x-. 
The first necessity is to determine this region more precisely. The graphs of f 
and g intersect when 


5 
x? — of Sah 


or x? —x?-x=0, 
or x(x7-—x—-—1)=0, 
Pew le¥5 


2 2 


— 


or =a 0 


@nethe interval ({h = J5 ]/2,.0)mwes hage x3 — x > x? and on the interval 
(0. [1 + V5]/2) we have x2 > x3 — x. These assertions are apparent from the 
graphs (Figure 5), but they can also be checked easily, as follows. Since f(x) = g(x) 
anlar x< — 0) /5 |/2, or [1 - “51/2, the function / — g does not change sign 


on the intervals ([{ 1 — 52. QO) and (O, [1+ V5 ]/2): it is therefore only necessary 
to observe, for example, that 


to conclude that 
f—2= 05 om (ll —W/5]/2, 0). 
fo e=0 Ton 0, (1 45)/2. 
The area of the region in question is thus 
0 _ 
i (ao = re) dc +/ = ‘eS Dee 


As this example reveals, one of the major problems involved in finding the areas 
of a region may be the exact determimation of the region. There are, however, 
more substantial problems of a logical nature — we have thus far defined the areas 
of some very special regions only, which do not even include some of the regions 
whose areas have just been computed! We have simply assumed that area made 


to 
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FIGURE 5 


sense for these regions, and that certain reasonable properties of “area” do hold. 
These remarks are not meant to suggest that you should regard exercising ingenuity 
to compute areas as beneath you, but are meant to indicate that a better approach 
to the definition of area is available, although its proper place is somewhere in 
advanced calculus. The desire to define area was the motivation, both in this 
book and historically, for the definition of the integral, but the integral does not 
really provide the best method of defining areas, although it is frequently the proper 
tool for computing them. 


It may be discouraging to learn that integrals are not suitable for the very pur- 
pose for which they were invented, but we will soon see how essential they are for 
other purposes. The most important use of integrals has already been emphasized: 
if f is continuous, the integral provides a function y such that 


y' (x) =o @). 


This equation is the simplest example of a “differential equation” (an equation 
for a function y which involves derivatives of y). The Fundamental Theorem 
of Calculus says that this differential equation has a solution, if f is continuous. 
In succeeding chapters, and in various problems, we will solve more complicated 
equations, but the solution almost always depends somehow on the integral; in 
order to solve a differential equation it is necessary to construct a new function, 
and the integral is one of the best ways of doing this. 

Since the differentiable functions provided by the Fundamental ‘Theorem of 
Calculus will play such a prominent role in later work, it is very important to 
realize that these functions may be combined, like less esoteric functions, to yield 
still more functions, whose derivatives can be found by the Chain Rule. 

Suppose, for example, that 


as 


x | 
Tea) — / ae 
a I!+sin*t 
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Although the notation tends to disguise the fact somewhat, f is the composition 
of the functions 


x ] 
C(x)=x* and F(x) = / ———— ah. 
a 1+sin*t 


In fact, f(x) = F(C(x)); in other words, f = F oC. Therefore, by the Chain 
Rule, 

f'@) =F'(C@)) 6 &) 
F'(x3) «3x? 


I > 
— ——~— 3x". 
Pye, 3 
1 + sin* x- 


If f is defined, instead, as 


a ] 
fan) = | —— dt, 
3 1+sin“t 


then 


] 
f'(«~) = -— >: 3x’. 
1 + sin* x3 


If f is defined as the reverse composition, 


3 


Xx | 3 
ror=(f ——- ar) . 
a |1+sin*t 
f'@)=C(FQ))- F’@) 


fe) es! 
Pie 
a 1+sin‘t 1 + sin“ x 


sin x | 
f(x) = / oe dt, 
a 1+ sin“f 


a | 
g(x) = i 0, 
sinx | + sin“t 


x ] 
h(x) = sin (/ —$<$— ar) ; 
a 1+sin*t 


| 
1+ sin? (sin x) 
—|] 


1+ sin7(sin x) 


- = l l 
h (x) = cos — dt ET 
a |1+sin“t 1 + sin* x 


then 


Similarly, if 


then 


foc 


*-COSX, 


COS X, 


B(x\= 
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The formidable appearing function 


yx) fhe an) a, 


ee 
a ] + sin“ t 


is also a composition; m fact, f = F o F. Therefore 


f Oia (PF (x)) FG) 
] ] 


D) _ | , cas 
1 + sin* (| —- ar) I+ sin” x 
a |+sin“t 


As these examples reveal, the expression occurring above (or below) the integral 
sign indicates the function which will appear on the mght when f is written as a 
composition. As a final example, consider the triple compositions 


Ce a dt) 
a | -tsins ¢ 


1 


3 
(/. + 2, ir) ] Peet ole dt ] 
Leo Ht ite ———dt,_— g(x)= i Ja isin ———— dt. 


nao o 
a cia f a = sie 6 


which can be written 
fee aC And wee = aheoumar, 


Omitting the mtermediate steps (which you may supply, if you still feel msecure), 
we obtain 


; l l 4 
on — ——SE ——— 
pa pal I | +sin" x3 
ls sige / SS 
a 1+sin"t 
Kx) l l 
Ol —— 
. 7 x l 
- (/. tre] | Sem (| ar) 
I+ sin aa a 1+sm*t 
a ] + sin” ¢ 
l 
I +sin?x 


Like the simpler differentiations of Chapter 10, these manipulations should be- 
come much easier after the practice provided by some of the problems, and, like 
the problems of Chapter 10, these differentiations are simply a test of your under- 
standing of the Chai Rule, in the somewhat unfamiliar context provided by the 
Fundamental Theorem of Calculus. 

The powerful uses to which the integral will be put in the following chapters 
all depend on the Fundamental ‘Vheorem of Calculus, yet the proof of that the- 
orem was quite easy— it seems that all the real work went ito the definition of 
the integral. Actually, this is not quite true. In order to apply ‘Vheorem | to a 
continuous function we need to know that if f is conunuous on [a,b], then f is 
mtegrable on [a, b]. Although we've already offered one proof of this result, there 
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is a more elementary argument that you might prefer. Like most “elementary” 
arguments, it’s quite tricky, but it has the virtue that it will force a review of the 
proof of Theorem 1. 


If f is any bounded function on [a, b], then 


sup{l@, P)) and mitUucs, P)} 


will both exist, even if f is not integrable. ‘These numbers are called the lower 
integral of f on [a,b] and the upper integral of f on [a, b], respectively, and 
will be denoted by 
b b 
L famervove| Mi! ihe 


a a 


The lower and upper integrals both have several properties which the integral 
possesses. In particular, if a <c < b, then 


uf auf refs and ufseufssufs 


and if m < f(x) < M for all x m [a,b], then 


b b 
mb- ays f fo U f <M(b—-a). 


a 
The proofs of these facts are left as an exercise, since they are quite similar to the 
corresponding proofs for integrals. ‘The results for mtegrals are actually a corollary 


of the results for upper and lower integrals, because f is integrable precisely when 


Lf s=u "ef 


a 


We will prove that a continuous function f is integrable by showing that this 
equality always holds for continuous functions. It is actually easier to show that 


ee 


for all x in [a,b]; the trick is to note that most of the proof of Theorem 1 didn’t 
even depend on the fact that f was integrable! 


THEOREM 13-3 


If f is continuous on [a,b], then f is integrable on [a, b]. 


PROOF — Define functions L and U on [a,b] by 


L@) =uf Dace Uy) = uf je 
et x be ity (a, b). Sieh >*) and 


m= inf{f/@)o* << « LAW 
Mi; =sup| f(r < t < 
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then 
xt+h x+h 
m,:-h<L fsuf f <M,-h, 
No) | | 
myp-h < L(x +h) — L(x) < U(x +h) — U(x) < M, +h 
or 


L(x +h) — L(x) : U(x +h) — U(x) J 


M,,. 
h h é 


mys 


If h <0 and 


m, = inf{f(t):x +h <t <x}, 
M, =sup{f(t):x +h <t <x}, 


one obtains the same inequality, precisely as in the proof of Theorem 1. 
Since f is continuous at x, we have 


lim m, = lim M;, = f(x), 
10 h-0 
and this proves that 
LG U'te):= fe) for in (a,b). 
This means that there is a number c such that 
U(x) = L(x)+c for all x in [a, db]. 
Since 
U(a)= L(a) =9, 
the number ¢ must equal 0, so 
U(x) = L(x) for all x in fa, b]. 


In particular, 


b b 


a 


and this means that f is integrable on [a,b]. J 


PROBLEMS 


1. Find the derivatives of each of the following functions. 


3 


(1) Fix) = | sin’ t dt. 


a 


fe sin’ rdt) l 
(nu) F(x) =i ntl 
3 


1+sin°t + #2 


x P y ] 
(111) F(x) = / (/ ay tt] dy. 
5 \Js 1+f4-+sin*t 


l 
(iv) F(x) = i TT dt. 
« L1_+t-+sin‘t 
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b , 
x 
a |+t++sin‘?r 


(vi) F(x) =sin ([ sin (/ sin? f ar) ay). 
0 JO 


1 
(Vil) F~' where F(x) = / ae a 
ee (Find (F~')(x) in terms of 


l —l¢y), 
de Br" Gey) 


> 
p.008 = 
(vin) F-°, where F(x) =); 
0 V1l-? 
For each of the following f, if F(x) = fy f, at which points x is F(x) = 
f(x)? (Caution: it might happen that F’(x) = f(x), even if f is not contin- 
uous at X.) 


i) A O=O easy Fx) = bt x= |. 
(un) fitp=Ostaeaals fRyHiil tx 1. 
(ii)! C—O nae te Fe) bait x= 1, 


iv) f(x) =O if x is irrational, f(x) = 1/g if x = p/q in lowest terms. 
WW SC =—O0 s= 0 fix) If x>:0: 

vi) AG) =Oar os Olorx = 1 f(x) = 1/[(l/x] F O< 2s a 

vil) f is the function shown in Figure 6. 


(viii) f(x) = lif x = 1/n for some n in N, f(x) = 0 otherwise. 


ee ES ee 


al 

>|— 

Ni=— 
_ 


FIGURE 6 


Show that the values of the following expressions do not depend on x: 


‘ | L/x l 
(1) / ——~ dt + i SS dt. 
Jo. bebe Jo L+t¢ 


sin x | 


(11) . —t—. xX im (O, 2/2). 
— COSK y/ ] — [- 
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*10. 


Lt. 


Find (f~')'(O) if 
GQ) fit) = ik 1 + sin(sin fr) dt. 
0 


x 
(i) =f (x) = cos(cos ft) dt. 
I 
(Don’t try to evaluate f explicitly.) 


Find a function g such that 


(i) [ tsmarerte. 
0 


x? = 
i) | te(t)dt =x +x. 
0 
(Notice that g is not assumed continuous at 0.) 


(a) Find all continuous functions f satisfying 
x “ 
/ F =) -E for some constant C 4 0 
0 


assuming that f has at most one 0. 

(b) Also find a solution that is 0 on an interval (—oo, b| with 0 < b, but 
non-zero for x > b. 

(c) Finally, for C = 0 and any interval [a, b] with a < 0 < J, find a solution 
that is 0 on [a,b], but non-zero elsewhere. 


Use Problem 13-23 to prove that 


Find F’(x) if F(x) = fj xf(t)dt. (The answer is not xf(x); you should 
perform an obvious manipulation on the integral before trying to find F’.) 


Prove that if f is continuous, then 


[ f(u)(x —u)du= i! (/ firydr) du. 
0 0 0 


Hint: Differentiate both sides, making use of Problem 8. 


Use Problem 9 to prove that 


x x “uo My 
/ f(uy(x — u)? du = 2 f (| (/ fdr) du; dup. 
0 0 0 0 


. ; , : <h oa eoel(s ; 
Find a function f such that f(x) = 1 yi V1l+sin°x. (This problem is 


supposed to be easy; don’t misinterpret the word “find.”) 


FIGURE 7 


P2. 


13. 


*14, 


*19; 


16. 


“17. 


18. 


19. 
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A function f is periodic, with period a, if f(x +a) = f(x) for all x. 


(a) If f is periodic with period a and integrable on [0, a], show that 


a b+a 
i = / f tor all b. 
0) b 


(b) Find a function f such that f is not periodic, but f’ is. Hint: Choose 
a periodic g for which it can be guaranteed that f(x) = Ih g is not 
periodic. 

(c) If f’ is periodic with period a and f(a) = f(O), then f is also periodic 
with period a. 


*(d) Conversely, if f’ is periodic with period a and f is periodic (with some 


period not necessarily = a), then f(a) = f(O). 


Find He /x dx, by simply guessing a function f with f’(x) = x, and using 
the Second Fundamental Theorem of Calculus. Then check with Prob- 
lem 13-21. 


Use the Fundamental Theorem of Calculus and Problem 13-21 to derive the 

result stated in Problem 12-21. 

Let C), C and C2 be curves passing through the origin, as shown in Figure 7, 

Each point on C can be joined to a point of C; with a vertical line segment 

and to a point of C2 with a horizontal line segment. We will say that C bisects 

C; and C2 if the regions A and B have equal areas for every point on C. 

(a) If Cis theieraplvot 7 (0) = x?,.x > Oand C is the sraph or f(a) = 2x, 
x > O, find C2 so that C bisects C) and Cp. 

(b) More generally, find C2 if C; is the graph of f(x) = x”, and C is the 
Ctapi Ole; (i — ex fonsome Gg. -al, 

(a) Find the derivatives of F(x) = fy 1/t dt and G(x) = bis lt dt. 

(b) Now give a new proof for Problem 13-15. 


Use the Fundamental Theorem of Calculus and Darboux’s Theorem (Prob- 
lem 11-60) to give another proof of the Intermediate Value Theorem. 


Prove that if f is continuous, f and g are differentiable, and 


(x) 
then F’(x) = h(g(x)) - 9’(x) — A(f(x)) - f’(x). Hint: Try to reduce this to 


the two cases you can already handle, with a constant either as the lower or 


g(x) 
BX l= / h(t) dt, 
if 


the upper limit of integration. 


Let f be integrable on [a, b], let c be in (a, b), and let 
Fx) =f St, @= % <b. 


For each of the following statements, give either a proof or a counterexample. 


(a) If f is differentiable at c, then F is differentiable at c. 
(b) If f is differentiable at c, then F’ is continuous at c. 
(c) If f’ is continuous at c, then F’ is continuous at c. 


300 = Derwwatives and Integrals 


#20. 


21. 


224 


sis » 


Let 
| 

COS =, te") 
x 

0: x =0. 


fa)= 


Is the function F(x) = fy f differentiable at 0? Hint: Stare at page 179. 


Suppose that f’ is integrable on [0,1] and f(0) = 0. Prove that for all x in 


{0, 1] we have 
| 
Toe if Tae 


Show also that the hypothesis f(0) = 0 is needed. Hint: Problem 13-39. 


Suppose that f is a differentiable function with f(0) = 0 and 0 < f’ < 1. 
Prove that for all x > 0 we have 


fins? <( fiz) 


(a) Suppose G’ = g and F’ = f. Prove that if the function y satisfies the 
differential equation 


(*) g(y(x)) + yc) = f(x) — for all x in some interval, 
then there ts a number c such that 
(2) G(y(x)) = F(x) +c for all x in this interval. 


(b) Show, conversely, that if y satisfies (*), then y is a solution of (x). 


(c) Find what condition y must satisfy if 


ee 


y (x) = 14 yx) 


(In this case g(t) = 1+¢t and f(t)=1+ t7.) Then “solve” the resulting 
equations to find all possible solutions y (no solution will have R as its 
domain). 


(d) Find what condition y must satisfy if 


—] 
775pc0)" 


f 


y @&) 


(An appeal to Problem 12-14 will show that there ae functions satisfying 
the resulting equation.) 


(ec) Find all functions y satisfying 
y(x)y'(x) = —x 


ind the solution y satisfying y(O) = —1. 


Vass 


(asi 


26. 


a7. 
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In Problem 10-19 we found that the Schwarzian derivative 
7 7 5 ey 
Tiles) ee 8) 


was 0 for f(x) = (ax + b)/(cx +d). Now suppose that f is any function 
whose Schwarzian derivative is 0. 


(a) f”?/f' is a constant function. 
(b) f is the form f(x) = (ax + b)/(cx +d). Hint: Consider wu = f’ and 


apply the previous problem. 


The limit Jim fa f, if it exists, is denoted by f~ f (or f* f(x) dx), and 


called an “improper integral.” 


(a) Determine ike © act ree |, 

(b) Use Problem 13-15 to show that f;* 1/x dx does not exist. Hint: What 
can you say about [ i 1/x dx? 

(c) Suppose that f(x) > O for x > O and that a f exists. Prove that if 
0 < g(x) < f(x) for all x > 0, and g is integrable on cach interval 
[0, N], then fj~ g also exists. 


(d) Explain why fy~ 1/(. + x*) dx exists. Hint: Split this integral up at 1. 


Decide whether or not the following improper integrals exist. 


x Y 
1 ————— (Ix. 
(i) / ae 


Xx 
il ———~ dx. 
(11) i [are ax 


(11 x (this is really a type considered in Problem 28). 


i ] 
) | =a 
0 xVvl+x 
rn . . a . . . ach SA a : a 
he improper integral f°. f is defined in the obvious way, as yim i ircm ke 
. « F rae (o's) a KB yi) Sa" : ie raeree 
But another kind of improper integral {~\ f is defined in a nonobvious way: 


2. 0 : ; 
it 1s the f+f ~ Jf, provided these improper integrals both exist. 


(a) Explam why f= 1/(1 +x?) dx exists. 


cone 8) 
: Ai OO 6 : . N 
(b) Explain why [°> x dx does not exist. (But notice that lim f~) x dx does 
ioe Nooo WN 
exist.) 
. ale, 6) ° . N ~ ee e P ROO 
(c) Prove that if f—.. f exists, then Jim erm arose inal eens Nr 
: N+] ; N . 
Show morcover, that lm [ f and tm A y2 f both exist and equal 
N>oo* N-ow* : 
Oo 1 . . x E n 
J-\ f. Gan you state a reasonable generalization of these facts? (If you 
can't, you will have a miserable time trying to do these special cases!) 
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28. 


30. 


There is another kind of “improper integral” in which the imterval is 
bounded, but the function is unbounded: 


(a) 


If a > O, find hm f'1//x dx. This limit is denoted by fj 1//x dx, 


even though the function f(x) = 1//x is not bounded on [0,a], no 
matter how we define f(Q). 

Find ip x avi) < p< 0, 

Use Problem 13-15 to show that 16 x7! dx does not make sense, even as 
a limit. 

Invent a reasonable definition of ie |x|" dx for a < O and compute it for 
—-l<r<Q0O. 

Invent a reasonable definition of Ie Ge dasa iowa 


ee ee : : “ a0) = 
limits, and show that the limits exist. Hint: Why does (es (lx) ax 
exist? How does (1 +.x)7!/2 compare with (1 — x2)7!/2 for —l < x < 0? 


| 
IO dt. 


If f is continuous on [0, 1], compute lim =| 
x—Ot 


as 


If f is mtegrable on [0, 1] and continuous at 0, compute 


l a 
: 1) 
lim x u ——dt. 
SO le FG 


It is possible, finally, to combine the two possible extensions of the notion of 
the integral. 


(a) 


(b) 


If fic =e ye fon 0 2a < "1 and fG) = i tor. = lefnd 
i f (x) dx (after deciding what this should mean). 
0 


OO 
Show that x’ dx never makes sense. (Distinguish the cases —] < 
0 
r <Oandr < —1. In one case things go wrong at 0, in the other case 
at 00; for r = —1 things go wrong at both places.) 
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CHAPTER 


ly 


hy 


THE TRIGONOMETRIC FUNCTIONS 


The definitions of the functions sin and cos are considerably more subtle than 
one might suspect. For this reason, this chapter begins with some informal and 
intuitive definitions, which should not be scrutinized too carefully, as they shall 
soon be replaced by the formal definitions which we really intend to use. 

In elementary geometry an angle is simply the union of two half-lines with a 
common initial point (Figure 1). 


————— oa 


ee 


FIGURE 1 


More useful for trigonometry are “directed angles,” which may be regarded as 
pairs (/;,/2) of half-lines with the same initial point, visualized as in Figure 2. 


ly 
ly 
hy 
I 
ly 
Ip 


RIG URE 2 


If for 1; we always choose the positive half of the horizontal axis, a directed angle 
is described completely by the second half-line (Figure 3). 

Since each half-line intersects the unit circle precisely once, a directed angle is 
described, even more simply, by a point on the unit circle (Figure 4), that is, by a 
point (x, y) with x7 + y* = 1. 
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FIGURE 4 


FIGURE 5 


FIGURE 6 


360 


The sine and cosine ofa directed angle can now be defined as follows (Figure 5): 
a directed angle is determined by a point (x, y) with x? + y* = 1; the sine of the 
angle is defined as y, and the cosine as x. 

Despite the aura of precision surrounding the previous paragraph, we are not 
yet finished with the definitions of sin and cos. Indeed, we have barely begun. 
What we have defined is the sine and cosine of a directed angle; what we want 
to define is sm.x and cosx for each number x. ‘The usual procedure for doing 
this depends on associating an angle to every number. ‘The oldest method is to 
“measure angles in degrees.” An angle “all the way around” is associated to 360, 
an angle “half-way around” is associated to 180, an angle “a quarter way around” 
to 90, etc. (Figure 6). The angle associated, in this manner, to the number x, is 
called “the angle of x degrees.” The angle of 0 degrees is the same as the angle 
of 360 degrees, and this ambiguity is purposely extended further, so that an angle 
of 90 degrees is also an angle of 360 + 90 degrees, etc. One can now define a 
function, which we will denote by sm_, as follows: 


sin’ (x) = sine of the angle of x degrees. 


There are two difficulties with this approach. Although it may be clear what we 
mean by an angle of 90 or 45 degrees, it is not quite clear what an angle of V2 
degrees is, for example. Even if this difficulty could be circumvented, it is unlikely 
that this system, depending as it does on the arbitrary choice of 360, will lead 
to elegant results— it would be sheer luck if the function sm” had mathematically 
pleasing properties. 

“Radian measure” appears to offer a remedy for both these defects. Given any 
number x, choose a pomt P on the unit circle such that x is the length of the 
arc of the circle beginning at (1, 0) and running counterclockwise to P (Figure 7). 
The directed angle determined by P 1s called “the angle of x radians.” Since the 
length of the whole circle is 27, the angle of x radians and the angle of 27 + x 
radians are identical. A function sin’ can now be defined as follows: 


sin’ (x) = sine of the angle of x radians. 


This same method can easily be adopted to define sm ; since we want to have 
sin’ 360 = sin’ 27, we can define 


= Mee 
i e— Sill SSN) Sona: 
360 180 
We shall soon drop the superscript r in sin’, smmce sin’ (and not sin’) is the only 


function which will terest us; before we do, a few words of warning are advisable. 
‘The expressions sm x and sin’ x are sometimes written 
Sie 
sin.x radians, 


but this notation is quite misleading; a number x is simply a number —it does not 
carry a banner mdicating that it is “in degrees” or “im radians.” If the meaning 


FIGURE 7 
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FIGURE 9 


length x 
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length 


area 
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of the notation “sm x” is in doubt one usually asks: 
“Is x in degrees or radians?” 
but what one means Is: 
“Do you mean ‘sin”’ or ‘sin”’?” 


Even for mathematicians, addicted to precision, these remarks might be dispens- 
able, were it not for the fact that failure to take them mto account will lead to 
incorrect answers to certain problems (an example is given in Problem 19). 

Although the function sin’ is the function which we wish to denote simply by sin 
(and use exclusively henceforth), there is a difficulty involved even m the definition 
of sin’. Our proposed definition depends on the concept of the length ofa curve. 
Although the length of a curve has been defined in several problems, it 1s also easy 
to reformulate the definition m terms of areas. (A treatment in terms of length 1s 
outhned in Problem 28.) 


Suppose that x is the length of the are of the unit circle from (1, 0) to P; this are 
thus contains x/2z of the total length 27 of the circumference of the unit circle. 
Let S denote the “sector” shown in Figure 8; S is bounded by the unit circle, the 
horizontal axis, and the half-Hne through (0,0) and P. The area of S should be 
x /2z times the area inside the unit circle, which we expect to be z; thus S$ should 
have area 

x BG 

ome 
We can therefore define cosx and sinx as the coordinates of the point P which 
determines a sector of area x/2. 

With these remarks as background, the rigorous definition of the functions sin 
and cos now begins. The first definition identifies 2 as the area of the unit circle 
more precisely, as twice the area of a semicircle (Figure 9). 


(This definition is not offered simply as an embellishment; to define the trig- 


onometric functions it will be necessary to first define sinx and cos.x only for 
Ones SRS hig 

The second definition is meant to describe, for —1 < x < 1, the area A(x) of 
the sector bounded by the unit circle, the horizontal axis, and the half-lme through 
(xy Ey i Ot << ihis arenrcan be expressed (Figure 10) as the sum of 


the area of a triangle and the area of a region under the unit circle: 


Dy 
+f Vl —?dt. 
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FIGURE 10 


BUG IWIRSES Tal 


DEFINITION 


SG Ra a2; 


DEFINITION 


‘his same formula happens to work for —1 <x <0 also. In this case (Figure 11), 


the term 
aa ox - 


» 


“~ 


is negative, and represents the area of the triangle which must be subtracted from 


the term 1 
if Vv lk—e* ai. 
x 


If —1 <x < 1,then 


Notice that if —1 <x < 1, then A is differentiable at x and (using the Funda- 
mental ‘Vheorem of Calculus), 


—2. 
Arta) pee Sh 1 ere ee ee 


Notice also (Figure 12) that on the interval [—1.1] the function A decveases 


I 
1 
Ai) =0+ f Valea 5 
—| 
to A(1) = 0. This follows directly from the definition of A, and also from the fact 
that its derivative is negative on (—1], 1). 
For 0 < x < @ we wish to define cosx and sinx as the coordinates of a point 


from 


P = (cosx,sin.x) on the unit circle which determines a sector whose area is x /2 
(figure 13). In other words: 


If 0 <x <a, then cos x is the unique number in [—1, 1] such that 
x 


7 (GOSe)—= a 


sin x = J 1 = (cos.x)2. 


THEOREM 1 


PROOF 


P = (cosx,sinx) 


x 
area 5 


FIGURE 13 
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This definition actually requires a few words of justification. In order to know 
that there 7s a number y satisfying A(y) = x/2, we use the fact that A is continuous, 
and that A takes on the values 0 and 2/2. This tacit appeal to the Intermediate 
Value Theorem is crucial, if we want to make our preliminary definition precise. 
Having made, and justified, our definition, we can now proceed quite rapidly. 


If 0 < x < 2, then 
cos’ (x) = —sinx, 
sin’ (x) = cos x. 
If B = 2A, then the definition A(cos.x) = x/2 can be written 
B(cosx) = Xx; 


in other words, cos is just the inverse of B. We have already computed that 


| 
Ate) = —{j——_—$___—. 
2V1—x? 
from which we conclude that 
; l 
B(x) = = —=_=. 
Vv1—x? 


Consequently, 
cos'(x) = (B-!)'(x) 
7 | 
PBB) 
| 
- l 
V1 =f )I- 
= — V1 —(cosx)? 


= —sinx. 


sinx = / 1 — (cos x)?, 


Since 


we also obtain 


—2cosx - cos’(x) 


/1— (cosx)? 


cos X sin x 


sin’ (x) = 


Nl — 


sin Xx 
=cosx. J 


The information contained in ‘Theorem | can be used to sketch the graphs of 
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sin and cos on the interval [0,2]. Since 
cos (x) =—sinx <0, O<x<n,, 


the function cos decreases from cos 0 = | to cosa = —1 (Figure 14). Consequently, 
cos y = 0 for a unique y in [0, 2]. ‘To find y, we note that the definition of cos, 


osx) = =. 
FIGURE 14 A(cosx) 5 


means that 


y 
A(Q) = 5 


y= v1—? dt. 
0 


SO 


It is easy to see that 


0 | 
V1 =Par= [ Vl—fdt 
FIGURE 15 -1 0 
so we can also write 


| Vi-Pat ==. 
=| 2 


Now we have 
>0; 0<2x-<7/2 


sin (x) = cos x 
<0)" Rf SX =H; 


so sin increases on [Q, 7/2] from sin0 = 0 to sina /2 = 1, and then decreases on 
[7 /2, x] to sna = 0 (Figure 15). 
The values of sinx and cosx for x not in [0,2] are most easily defined by a 
two-step piecing together process: 
(1) Ifm <x < 27, then 


sinx = —sin(27 — x), 
cosx = cos(27 — x). 


Figure 16 shows the graphs of sin and cos on [0, 27]. 


(2) If x = 27k +x’ for some integer k, and some x’ in [0, 277], then 


sinx = sin’, 


/ 
. 


COS X = COS X 


Figure 17 shows the graphs of sin and cos, now defined on all of R. 
(b) Having extended the functions sin and cos to R, we must now check that the 
FIGURE 16 basic properties of these functions continue to hold. In most cases this is easy. For 
example, it is clear that the equation 


z 2 9 
sin~ x--+ cos’ x = | 
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FIGURE 17 
holds for all x. Itis also not hard to prove that 


sin’ (x) = cos x, 
cos (x) = —sinx, 


if x is not a multiple of 2. For example, if 7 < x < 27, then 


sin x = —sin(27 — x), 
SO 
sin (x) = —sin’(2m7 — x)-(—1) 
= cos(27 — x) 
= COS X. 


If x is a multiple of 2 we resort to a trick; it is only necessary to apply Theo- 
rem 11-7 to conclude that the same formulas are true in this case also. 


ee ee 


1 { | | 
i il. | | 
j i\6 ot ' | 
| | | | 
\ | | | 
| | | | 
i T = a —f 
2 | aN 2n 
1 | NaI | 
i | 2 | | 
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THEOREM 2 
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PROOF 
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arcsm 


FIGURE 20 


The other standard trigonometric functions present no difficulty at all. We 


define 
] 
Se — 
COE ie = lot wat 
sin x 
tanx = 
cos.x 
] 
cco, = — 
sIn.x ’ ’ 
Be Sa Sr. 
COs: i 
cotx = — 
sin.x 


The graphs are sketched in Figure 18. It is a good idea to convince yourself that 
the general features of these graphs can be predicted from the derivatives of these 
functions, which are listed in the next theorem (there is no need to memorize the 
statement of the theorem, since the results can be rederived whenever needed.) 


[ih rer oe 2. ule 


sec’/(x) = sec x tan x, 
9 
tan’(x) = sec’ x. 


ti x = ke. them 


esc (x) = — csc x cot x, 


sD 
eq (ie) —"=acse Xx. 
Left to you (a straightforward computation). J 


The inverses of the trigonometric functions are also easily differentiated. ‘Vhe 
trigonometric functions are not one-one, so it Is first necessary to restrict them 
to suitable intervals; the largest possible length obtainable is 2, and the intervals 
usually chosen are (Figure 19) 


[—m/2,2/2] for sin, 
[0, x] for cos, 
(—2/2,1/2) for tan. 


(The inverses of the other trigonometric functions are so rarely used that they will 
not even be discussed here.) 
The inverse of the function 
tein x, =| Daa Ne ee 

is denoted by aresin (Figure 20); the domain of arcsin is [—1, 1]. Phe notation 
pons | : ee : : : : 

sur has been avoided because arcsin is not the inverse of sin (which Is not one- 
one), but of the restricted function f; sometimes this function f is denoted by Sin, 


: || 
aud ares by Sin. 


PIGURE 21 


FIGURE 23 THEOREM 3 
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The inverse of the function 
o(x) =6eos x: OS em 7 


is denoted by arecos (Figure 21); the domain of arccos is [—1, |]. Sometimes g 
is denoted by Cos, and arccos by Cos !. 


‘The inverse of the function 
Hi @a= tail Xe —n/2<x <n/2 


is denoted by arctan (Figure 22); arctan is one of the simplest examples of a 
differentiable function which is bounded even though it is one-one on all of R. 
Sometimes the function h is denoted by Tan, and arctan by Tan™. 

The derivatives of the inverse trigonometric functions are surprisingly simple, 
and do not involve trigonometric functions at all. Finding the derivatives is a simple 
matter, but to express them in a suitable form we will have to simplify expressions 
like 


Gostamcsim < ). sec(arctan x). 


arctan 


FIGURE 22 


A little picture is the best way to remember the correct simplifications. For exam- 
ple, Figure 23 shows a directed angle whose sine is x- the angle shown is thus an 
angle of (arcsin x) radians; consequently cos(arcsin x) is the length of the other 


A D) 3 a 
side, namely, ¥ 1 — x-. However, in the proof of the next theorem we will not 
resort to such pictures. 


i= aoe = Te then 


l 


v1 Sn 


—] 


eae, 


arcsin (x) = 
arccos (x) = 
Moreover, for all x we have 


arctan’ (x) = ——. 
l+x- 
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PROOF 


arcsin’ (x) = (f~")'(x) 

| 
FS) 

| 
sin (aresin x) 

l 
cos(arcsin x) 
Now 

[sin(arcsin x)]* + [cos(aresin x)]? = 1, 


that 1s, 


2 : 2) 
x~ + |cos(aresin x)|* = I; 


therefore, 
. 9 
cos(arcsin x) = V1 — x. 


(The positive square root is to be taken because arcsin.x is in (—2/2, 2/2), so 
cos(arcsin x) > 0.) ‘This proves the first formula. 

The second formula has already been established (in the proof of Theorem 1). 
It is also possible to imitate the proof for the first formula, a valuable exercise if 
that proof presented any difficulties. The third formula is proved as follows. 

arctan’ (x) = (h-!)'(x) 
| 

h'(h-'(x)) 

| 
tan’(arctan x) 

| 
~ sec? (arctan x) 
Dividing both sides of the identity 

sin? a + cos?a = | 


? . 
by cos“ a yields 


9 y 
tan°-a+1=sec“a. 


It follows that 


; 2 ray, 
[tan(arctan x)|~ + | = sec*(arctan x), 


or 


, 6] 
x~ + 1 = sec*(arctan x), 


which proves the third formula. J 


The traditional proof of the formula sin’(x) = cos.x (quite different from the one 
given here) is outlined in Problem 27. This proof depends upon first establishing 


LEMMA 


PROOF 


THEOREM 4 
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the limit 
. sink 
lim = ||) 
h--0 h 


and the “addition formula” 
sin(x + y) = sin x cos y + cosx sin y. 


Both of these formulas can be derived easily now that the derivative of sin and cos 
are known. The first is just the special case sin'(0) = cos0. The second depends 
on a beautiful characterization of the functions sin and cos. In order to derive this 
result we need a lemma whose proof involves a clever trick; a more straightforward 
proof will be supplied in Part TV. 


Suppose f has a second derivative ever vhere and that 
I 


f+ f =O, 
ya(Q) =a, 
fae 0) 


whens =U. 


Multiplying both sides of the first equation by f’ yields 
ff" + ae = 0: 
‘Thus 
[p= fal = 207 Fe Ff) = 0, 


so (f’)* + f? is a constant function. From f(O) =0 and f'(0O) = 0 it follows that 
the constant is 0; thus 


Lf)? + [Ff]? =0 for all x. 


This implies that 


(GG) = 0° forall. ff 


If f has a second derivative everywhere and 


fad i - =a 
(OES fy 
ja) iy 


then 


f =b-sin+a-cos. 


(In particular, if f(0) = 0 and f’(O) = 1, then f =sm;if f(0) = 1 and f’(0) = 0, 


then f = cos.) 
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PROOF Let 
g(x) = f(x) —bsinx —acosx. 
Then 
g(x) = f(x) — bcosx +asinx, 
eg’ (x) = f"(x) + bsinx + acosx. 


Consequently, 


g’ +g =0, 
g(0O) = 0, 
2'(O) = 0, 


which shows that 


0 = g(x) = f(x) — bsinx —acosx, for all x. J 


THEOREM 5 If x and y are any two numbers, then 
sin(x + y) =sinx cos y + cosx sin y, 
cos(x + y) = cos x cos y — sin x sin y. 
PROOF For any particular number y we can define a function f by 
f(x) =sin(x + y). 
Then 
f(x) = cos(x + y) 
f(x) = —sin(x + y). 
Consequently, 


f" + f =9, 
f(O) = sin y, 
f' (0) = cos y. 


It follows from Theorem 4 that 
f = (cos y)+ sin +(sin y) - cos: 


that is, 
sin(x + y) = cos y sin.x + sin y cos x, for all x. 
Since any number y could have been chosen to begin with, this proves the first 
formula for all x and y. 
The second formula is proved similarly. § 


As a conclusion to this chapter, and as a prelude to Chapter 18, we will mention 
an alternative approach to the definition of the function sin. Since 


arcsin (x) = ————.. for —l=x<=lL, 


V1 x 
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it follows from the Second Fundamental Theorem of Calculus that 


Xx ] 
arcsin x = arcsin x — arcsin 0 = i ———— dt. 
0 Vl-?r 
This equation could have been taken as the definition of arcsin. It would follow 
immediately that 


| 
Vv | = 


the function sin could then be defined as (arcsin)~! and the formula for the deriva- 
tive of an inverse function would show that 


a} 5 
sin (x) =v | — sins x, 


which could be defined as cos.x. Eventually, one could show that A(cos.x) = x/2, 
recovering at the very end of the development the definition with which we started. 
While much of this presentation would proceed more rapidly, the definition would 
be utterly unmotivated; the reasonableness of the definitions would be known to 
the author, but not to the student, for whom it was intended! Nevertheless, as 


arcsin (x) = 


we shall see in Chapter 18, an approach of this sort is sometimes very reasonable 
indeed. 


PROBLEMS 
1. Differentiate each of the following functions. 


a) f(x) = arctan(arctan(arctan x)). 
(i) “f@)= aresin(arctan(arceos x )). 
Gn) f(x) = arctan(tan x arctan x). 


l 
a bee 


2. Find the followimg limits by Hopital’s Rule. 


Gv) f(x) =arcsin 


Bigs = oh eS 


o lim 

( ) e—10) x3 

iy i sink (6 

il ——————— 
10) xO 


welll Bice 
Gi) = lim ————_—-. 


x0) x2 
a Gosx — 1+ 2/2 
Cs ee 
x0 AG 
Do ekecrinbe = 6} 
CO). hn 
x0 x 


Pa ( I | ) 
wi) lm|-—-—- 
x-0\x 9 sinx 
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Let (a) = |r 


(a) 
(b) 


I. eo") 


Find f’(0). 
Find f’(0). 


At this point, you will almost certainly have to use /H6pital’s Rule, but in 
Chapter 24 we will be able to find f ‘)(Q) for all k, with almost no work 
at all. 


Graph the following functions. 


(g) 


iO =sir2x, 
Fxy = sin(x2). (A pretty respectable sketch of this graph can be ob- 
tained using only a picture of the graph of sin. Indeed, pure thought 
is your only hope in this problem, because determining the sign of the 
derivative f’(x) = cos(x2)-2x is no easier than determining the behavior 
of f directly. The formula for f’(x) does indicate one important fact, 
however— f’(0) = 0, which must be true since f is even, and which 
should be clear in your graph.) 
f(x) = smx + sin 2x. (It will probably be instructive to first draw the 
graphs of g(x) = smx and h(x) = sin 2x carefully on the same set of 
axes, from 0 to 27, and guess what the sum will look lke. You can 
easily find out how many critical pomts f has on [0, 27] by considering 
the derivative of f. You can then determine the nature of these critical 
points by finding out the sign of f at each point; your sketch will probably 
suggest the answer.) 
f(x) = tan x — x. (First determine the behavior of f in (—2/2, 7/2); m 
the intervals (ka — 1/2, ka + 2/2) the graph of f will look exactly the 
same, except moved up a certain amount. Why?) 
f(x) = sinx — x. (The material in the Appendix to Chapter 11 will be 
particularly helpful for this function.) 

sin Xx 
ee 

1, a). 
(Part (d) should enable you to determine approximately where the zeros 
of f’ are located. Notice that f is even and continuous at 0; also consider 
the size of f for large x.) 
i) = sib x. 


The hyperbolic spiral is the graph of the function f{(6) = a/@ in polar coordi 
nates (Chapter 4, Appendix 3). Sketch this curve, paying particular attention 


to its behavior for @ close to 0. 


Prove the addition formula for cos. 


ie; 


10. 


1 


(a) 


(b) 


(b) 
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From the addition formula for sin and cos derive formulas for sin 2x, 
cos 2x, sin 3x, and cos 3x. 

Use these formulas to find the following values of the trigonometric func- 
tions (usually deduced by geometric arguments in elementary trigonom- 


etry): 
Ft x 2 
sin — = cos — = —, 
4 2 
t a: | 
an — = 1, 
4 
ene ] 
sin a. 
xn WS 
0s — = —. 
5 = 


Show that A sin(x + B) can be written as a sin.x + bcos.x for suitable a 
and b. (One of the theorems in this chapter provides a one-line proof. 
You should also be able to figure out what a and bare.) 

Conversely, given a and b, find numbers A and B such that asin.x + 
bcosx = Asin(x + B) for all x. 

Use part (b) to graph f(x) = J3sin x + cos x. 


Prove that 
tan.x + tan y 
tan(x + y) = ——————— 
] — tan x tan y 
provided that x, y, and x + y are not of the form kz + 2/2. (Use the 
addition formulas for sin and cos.) 


Prove that 


x+y 
arctan x + arctan y = arctan {| ——— }. 
== XY 


indicating any necessary restricuons on x and y. Hint: Replace x by 
arctan x and y by arctan y in part (a). 


Prove that 


arcsin w + arcsin B = arcsin(av | — B° + Bv1l— a? ), 


indicating any restrictions on @ and fp. 


Prove that if m and n are any numbers, then 


sin mx sinax = 5[cos(m — n)x — cos(m +n)x}, 


Ni=— 


sin mx cosnx = 4{sin(m + n)x +sin(m —n)x}, 


COS INX COSHX = 


Ni— bol 


[cos(m + n)x + cos(m — n)x]. 
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12. Prove that if m and n are natural numbers, then 


oo 0, mAn 
sin mx sinnx dx = 


TT mn, 


y 0, msn 
cosmx cosnx dx = 


Tr mM n, 


y1 & 
/ sinmx cosnx dx = 0. 


1T 
‘These relations are particularly important in the theory of Fourier series. Al- 
though this topic will receive serious attention only in the Suggested Reading 
(see reference |26]), the next problem provides a hint as to their importance. 


13. (a) If f is mtegrable on [—z, 2], show that the minimum value of 


wT 
i) (f(x) —acosnx)* dx 


Fi 


occurs when 


| WT 
a=— f(x) cosnx dx, 
IT —T 
and the minimum value of 


/ (f(x)—a sin nx) dx 


Ps d 


when 


Poe . 
a=— f(x) sinnx dx. 
WH J—nx 


(In each case, bring a outside the integral sign, obtaining a quadratic 
expression in @.) 


(b) Define 


| T 
G, = -| ii{x)cosnxdx, n=O 2 soc 


Teja 
] Pi 

b, = -| WAR) SMU AX, N= Ws oo, ose 
UT J—n 


Show that if ¢; and d; are any numbers, then 


~ 


+ C N 
0 ‘ 
i f(x) -— = oo oy Cn, cosnx + d, sinnx dx 


—FU, 


” 


‘ x 2 N 
< 5) agcg CO : ; 
- il, Lf @ | dx —2n =; a a ye AnCn + bydn +1 ce + ye Cn” + ap 


n=! 


~ 


n=l n=l 


9 N 
1 te : 
= / [ f(x)|? dx —7X > + ae 4b 
——Tt ~_ 


n=! 


2 N 
(a4) ay 5) 9 
+ I & = wh + ) (C; —ay)> +r (d,, — Dy) ’ 


n=1 


14. 


1S. 


16. 


Lid 


18. 


he 


20. 


21. 


22, 
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thus showing that the first integral is smallest when a; = cj and bj = dj. 
In other words, among all “linear combinations” of the functions s,(«) = 
sinnx and c,(x) = cosax for 1 <n < N, the particular function 


N 
ag ; 
2% > + ) a, cosnx + b, smnx 


n=1 


has the “closest fit” to f on [—z, z]. 


(a) Find a formula for sin x + sin y. (Notice that this also gives a formula for 
sin.x —siny.) Hint: First find a formula for sin(a +b) + sin(a — b). What 
good does that do? 

(b) Also find a formula for cosx + cos y and cosx — cos y. 


' ; e : 2S ee. & 

a) Starting from the formula for cos 2x, derive formulas for sin x and cos? x 
oS 

in terms of cos 2x. 


(b) Prove that 


for_0 2c <7 72, 
(c) Use part (a) to find i sin? x dx and ie cos? x dx. 
(d) Graph f@) = sin? x 
Find sin(arctan x) and cos(arctan x) as expressions not involving trigono- 
metric functions. Hint: y = arctan x means that x = tan y = sin y/cosy = 


a 6 
sin y/v 1 — sin‘ y. 


If x = tanu/2, express sinu and cosu in terms of x. (Use Problem 16; the 
answers should be very simple expressions.) 


(a) Prove that sm(x + 27/2) = cosx. (All along we have been drawing the 
graphs of sin and cos as if this were the case.) 
(b) What is aresin(cos x) and arecos(sin x)? 


1 
(a) Find [a 
0 


(b) wh dt. 
0 — 


Find hm x sin — 
x00 wT 


mth Hint: The answer ts not 45. 


(a) Define functions sin” and cos° by sin’(x) = sin(zx/180) and cos’ (x) = 
cos(ax/180). Find (sin’)’ and (cos’)’ in terms of these same functions. 


- sins cael 
(b) Find lim and hin x sim —- 
x>0 Xx X00 x 


Prove that every point on the unit circle is of the form (cos 6, sin @) for at 
least one (and hence for infinitely many) numbers 6. 
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23. (a) Prove that mw is the maximum possible length of an imterval on 
which sin is one-one, and that such an interval must be of the form 
[2k —m/2,2ka +n/2] or [2kn +77/2, 2(k + 1) —2/2]. 
(b) Suppose we let g(x) = sinx for x in (2ka — 2/2, 2ka + 2/2). What is 
Cie 
24. Let f(x) = secx for 0 < x < a. Find the domain of f~! and sketch its 


graph. 


25. Prove that |sinx — sin y| < |x — y| for all numbers x 4 y. Hint: The same 
statement, with < replaced by <, is a very straightforward consequence of a 
well-known theorem; simple supplementary considerations then allow < to 
be improved to <. 


*26. It is an excellent test of mtuition to predict the value of 


b 
jim f f(x)sinaAx dx. 


Continuous functions should be most accessible to intuition, but once you 
get the right idea for a proof the limit can easily be established for any inte- 
grable f. 
(a) Show that lim f* sindAx dx = 0, by computing the integral explicitly. 
, i> OO 
(b) Show that if s is a step function on [a,b] (terminology from Prob- 
2 t : 
lem 13-26), then hm f. Ser) sin Ax dx = 0: 
A100 
= ; i ; 
(c) Finally, use Problem 13-26 to show that lim le 7 (x) sn Ax di = 0 for 
A—>co 


any function f which is integrable on [a, b]. This result, like Problem 12, 
plays an important role in the theory of Fourier series; it is known as the 
G Riemann-Lebesgue Lemma. 


27. ‘This problem outlines the classical approach to the trigonometric functions. 
The shaded sector in Figure 24 has area x/2. 


(a) By considering the triangles OAB and OCB prove that if 0 < x < 7/4, 


then : 
sin.x Bs smn.¥ 
B= (1,0) a ae 
2 we DCOS X 
FIGURE 24 (b) Conclude that 
sin. 
cosx < "2 
xX 
and prove that 
_ sins 
lim =|) 
x-0 XX 


(c) Use this luniit to find 


#25. 


9: 


*30. 


(d) 
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Using parts (b) and (c), and the addition formula for sin, find sin’(x), 
starting from the definition of the derivative. 


This problem gives a treatment of the trigonometric functions in terms of 


length, and uses Problem 13-25. Let f(~) = V1 —x? for -1 < x < 1. 
Define £(x) to be the length of f on [x, I]. 


(a) 


Show that 


a! 
L(x) =|) =———— dt. 
: oy PS 
(This is an improper integral, as defined in Problem 14-28, so you must 
first prove the corresponding assertion for the length on [x, | — e] and 
then prove that £(x) is the limit of these lengths as ¢ > 0*.) 
Show that 


l 


LQ) === for =| <x <5 


| ee 


Define 7 as £(—1). For 0 < x < a, define cosx by L(cosx) = x, and 
define sinx = V1 —cos?.x. Prove that cos’(x) = —sinx and sin’(x) = 


coss4or @ <2 <tr, 


Yet another development of the trigonometric functions was briefly men- 


tioned in the text starting with inverse functions defined by integrals. It 
is convenient to begin with arctan, since this function is defined for all x. 


To do this problem, pretend that you have never heard of the trigonometric 


functions. 


(a) 


Deu 0s) = rc +17)-! dt. Prove that w is odd and increasing, and that 


lim a(x) and hm a(x) both exist, and are negatives of each other. If 
3 cer 1G, B) bia AOS, 7 


we define 7 = 2 lim a(x), then a~! is defined on (—2/2, 2/2). 
AC — >.>.) 


Show that (@~!)/(x) = 1+ [on Gol 
for —2,/2,== 7/2 celine tan v= a~'(x), and then define sinx = 


tanx/v1+ tan? x. Show that 


(1) lia sin eal 
xn /2- 
(11) hm  snx =—! 
x—>—m/2+ 
sin.x 
ee ee ——, —m/2<x<a/2andx40 
(m) sin (x) = tan.x 
Ne oe) 
(iv) sin’(x) = —sinx for —2/2 <x < 72. 


If we are willing to assuine that certain differential equations have solutious, 


another approach to the trigonometric functions is possible. Suppose, in 
particular, that there is some function yo which is not always 0 and which 


satisfies yo” + vo = 0. 
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ol, 


#O2. 


(a) 
(b) 


Prove that yo? + (y0’)* is constant, and conclude that either yo(0) = 0) 
or yo (0) 4 0. 

Prove that there 1s a function s satisfying s” + s = 0 and s(O) = 0 and 
s‘(0) = 1. Hint: Try s of the form ayo + byo’. 


If we define sin = s and cos = s’, then almost all facts about trigono- 
metric functions become trivial. ‘There is one point which requires work, 
however— producing the number z. This is most casily done using an 
exercise from the Appendix to Chapter 11: 


Use Problem 6 of the Appendix to Chapter 11 to prove that cos x cannot 
be positive for all x > 0. It follows that there is a smallest x9 > O with 
cos x9 = 0, and we can define 7 = 2xp. 

Prove that.sinz/2-— jla(Since sin? + cos? = 1, we have sin 5 ee — Ses el 
the problem ts to decide why sin 2/2 ts positive.) 

Find cos 7, sinz, cos 27, and sin 27. (Naturally you may use any addi- 
tion formulas, since these can be derived once we know that sin’ = cos 
and cos’ = — sin.) 

Prove that cos and sin are periodic with period 27. 


After all the work involved in the definition of sin, it would be discon- 
certing to find that sm is actually a rational function. Prove that it isn’t. 
(There ts a simple property of sin which a rational function cannot pos- 
sibly have.) 

Prove that sin isn’t even defined imphcitly by an algebraic equation; that 
is, there do not exist rational functions fo, ..., ar sua that 


(Sine) =. f, SG) (om 2 oo Fo) =O for all x, 


Hint: Prove that fo = 0, so that sin x can be factored out. ‘Vhe remainmg 
factor is 0 except perhaps at multiples of 2. But this implies that it is 0 
for all x. (Why?) You are now set up for a proof by mduction. 


Suppose that ¢@; and ¢2 satisfy 


o: + io) =0. 
62" + g2¢2 = 0, 


and that g2 > g). 


(a) 


(b) 


Show that 


b1"b2 — o2"b — (22 — 2))b1¢2 = 0. 


Show that if @)(«) > 0 and 2(x) > O for all x mn (a,b), then 


b 
i 51! Game Onc Pen 


a 


and conclude that 


[py '(b)b2(b) — oy'(a \pr(a)] — [1 (b)b2'(b) a p\(a)2 (a)| =U: 


33. 


(c) 
(d) 
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Show that in this case we cannot have ¢1(a) = ¢1(b) = 0. Hint: Con- 
sider the sign of @1'(a) and ¢$)'(b). 

Show that the equations ¢; (a) = ¢1(b) = 0 are also impossible if ¢; > 0, 
g2 < 0 or dg) < 0, d2 > 0, or @ < 0, d2 < 0 on (a,b). (You should be 


able to do this with almost no extra work.) 


The net result of this problem may be stated as follows: if a and b are 
consecutive zeros of @1, then ¢2 must have a zero somewhere between 
a and b. This result, in a slightly more general form, is known as the 
Sturm Comparison Theorem. As a particular example, any solution of 
the differential equation 


ve ol (wi + l)y = @) 
must have at least one zero in any interval (#7, (a + 1)7). 


Using the formula for sin x — sin y derived in Problem 14, show that 


d es 
sin(k + 4)x —sin(k — 4)x = 2sin 5 COs oer 


~ 


Conclude that 


| sin(n + pe 
5 + Cosx + Cos 20=r COSn Y= =a Cai 
“ 2 sin = 

2 
Like two other results in this problem set, this equation is very important 
in the study of Fourier series, and we also make use of it in Problems 19-43 
and 23-22. 


Similarly, derive the formula 


sinx + sin 2x +---+sinax = 


(A more natural derivation of these formulas will be given m Prob- 


lem 27-14.) 


b b 
Use parts (b) and (c) to find i sinx dx and / cosx dx directly from 
0 0 


the definition of the integral. 


*CHAPTER 


Jt IS IRRATIONAL 


This short chapter, diverging from the main stream of the book, is included to 
demonstrate that we are already in a position to do some sophisticated mathemat- 
ics. This entire chapter is devoted to an elementary proof that z is irrational. Like 
many “elementary” proofs of deep theorems, the motivation for many steps in our 
proof cannot be supplied; nevertheless, it is still quite possible to follow the proof 
step-by-step. 

‘Two observations must be made betore the proof. The first concerns the func- 
tion 
eal =x)" 


Fnt®) = | 
n: 


which clearly satisfies 
: Lgies 
C= Eh) < = for 0 <x <1. 
n! 


An important property of the function /f,, is revealed by considering the expression 
obtained by actually multiplying out x”(1 —.x)". ‘The lowest power of x appearing 
will be n and the highest power will be 2n. Thus f, can be written in the form 


2n 


| 
EnlX) —= a ) Cit, 
n! 
i=" 
where the numbers ¢; are integers. It is clear from this expression that 


fi™ 0) =0 ifk <nork > 2n. 


Moreover, 


ir = —[n!cn + terms involving x] 
n! 


FPG) = —[(n + 1)!e,.) + terms involving x 
n! 


In©” (x) = —[2n)! ern]. 


324 


16. 1 ws Irrational 325 


‘This means that 


fir? (0) = Cn; 
fax" 0) = (n Je Yona 


fa?” (0) = (2n)(2n — 1) -...- (n+ Ler», 
where the numbers on the right are all integers. Thus 


fa'®(O) is an integer for all k. 
The relation 


Snr(X) = fr =) 


implies that 


fn (x) = (-1) fa 1 — x); 
therefore, 
fr (C1) is also an integer for all k. 


The proof that z is irrational requires one further observation: if a is any 
positive number, and ¢ > O, then for sufficiently large n we will have 


a’ 


— <6. 
n! 
To prove this, notice that if nm > 2a, then 


qnt! a qa" la 


(n+1)! n+] nl =o al 


Now let no be any natural number with no > 2a. Then, whatever value 


nO 


a 
(ng)! 
may have, the succeeding values satisfy 
qtor 1) | qa” 
—— -< —* 
(no +1)! 2 (no)! 
qot+2) | qtr) | 1 q"0 


(no + 2)! = 2 , (no + 1)! * 2 2 (no)! 


qitot’) q” 


l 
(ng +k)! e Qk (no)! 


ny 


If k is so large that < 2* then 


(no)! é 
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qitotk) 
which is the desired result. Having made these observations, we are ready for the 
one theorem in this chapter. 


rp . . . . ~ ys . . NT . . . . 

Che number is irrational; in fact, 2~ 1s irrational. (Notice that the irrationality 
. 9 NO . . . . + . “ . o] . 

of 2 implies the irrationality of 7, for if 2 were rational, then 2~ certainly would 

be.) 


. 9 ° 
Suppose ~~ were rational, so that 


for some positive integers a and b. Let 
(1) G@)'= b" | 27" fy (x) Sp £2 x) ae OOO) 

= see (=1)" fy (x). 
Notice that each of the factors 


hb" x2" 2k == b" (7)" k = b” (") = a" k pk 
2) 


is an integer. Since f,)(O) and f,,“)(1) are integers, this shows that 
G(O) and G(1) are integers. 

Differentiating G twice yields 

(2) Cxy= b"[27" fy" (x) = ade FAG Ge ee 3s a Shamim 3b 

The last term, (—1)" fy°2"*?)(x), is zero. Thus, adding (1) and (2) gives 

(3) G" (x) +2°G(x) = bint? £ (x) = ma" f(x). 


Now let 
H(x) = G(x) sinax — 7G(x)cos7x. 


‘Then 


H'(x) =2G'(x) cosax + G(x) sinax — 1G'(x)cosmx + 27G(x) sinax 
9) 


= [G"(x) + 2°G(x)] sinax 
= ma" f,(x) sin mx, by (3). 


By the Second Fundamental Theorem of Calculus, 


rf a" f,(x)sinax dx = H(1) — H(Q) 
0 


= G'(1)sinaz —2G(1) cosa — G'(0) sin0 + 2 G(0) cosO 
m|G(1) + G(O)]. 


Thus 


EO ee 
| a" f,(x)sinaxdx is an mteger. 
JO 
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On: the othenhand (Oe G,GO:) =a! tor Oleh, so 


Tar 


O = 7a" fax) sitx =< a fom@i— — ah. 
n! 


Consequently, 


n! 


] n 
ma 
QO< x | a" f,(x)sinax dx < ——. 
0 


‘This reasoning was completely independent of the value of n. Now if 1 is large 
enough, then 


1 n 
, ma 
Oro a” f,(x)sin wx dx < — < l. 
0 n! 
But this is absurd, because the integral is an integer, and there is no integer between 
= a ad 5 a 3) 8 . 5 
Q and |. ‘Thus our original assumption must have been incorrect: 2 is irrational. 
fe) 


This proof is admittedly mysterious; perhaps most mysterious of all is the way 
that wz enters the proof—it almost looks as if we have proved z irrational without 
ever mentioning a definition of a. A close reexamination of the proof will show 
that precisely one property of z 1s essential 


Sale) = Ob 


The proof really depends on the properties of the function sin, and proves the 
irrationality of the smallest positive number x with sinx = 0. In fact, very few 
properties of sin are required, namely, 


sin’ = Cos, 

cos = —sin, 
iO) — OP 
Gos (0) — ie 


Even this list could be shortened; as far as the proof is concerned, cos might just 
~ oe, 4 . ae . . . 

as well be defined as sin. ‘The properties of sin required in the proof may then be 

written 


sin” +sin = 0, 
Sin 0) — ©, 
sin (0) = |. 


Of course, this is not really very surprising at all, since, as we have seen in the 
previous chapter, these properties characterize the function sin completely. 


PROBLEMS 


1. (a) Forthe areas of triangles OAB and OAC in Figure 1, with ZAOB < 7/4, 
show that we have 
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FIGURE 1 


(b) 


(a) 


(b) 


1 — /1 — 16(area OAB)? 


2 


area OAC = 


N|— 


; . = 9 9 
Hint: Solve the equations xy = 2(area OAB), x- + y- = 1, for y. 
Let P,, be the regular polygon of m sides inscribed in the unit circle. If 
Am is the area of P,, show that 


A2m = SV 2-2 1 —12A»/m)?: 


This result allows one to obtain (more and more complicated) expressions 
for Ag, starting with Ay = 2, and thus to compute 7 as accurately 
as desired (according to Problem 8-11). Although better methods will 
appear in Chapter 20, a slight variant of this approach yields a very 
interesting expression for 7: 


Using the fact that 


area(OAB) _ 
area(OAC) az 


show that if @,, is the distance from O to one side of P,,, then 


Am a 
a mm 
A2m 
Show that 
2 
= W4-Ag-...* Mk-1 
re 
Using the fact that 
4 
Qn = COS —, 
m 
F l+cosx , 
and the formula cos x/2 = ,/ ——— (Problem 15-15), prove that 
= 
a= 
a16 = 


etc, 
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Together with part (b), this shows that 2/2 can be written as an “infinite 
product” 


to be precise, this equation means that the product of the first 1 factors 
can be made as close to 2/z as desired, by choosing n sufficiently large. 
‘This product was discovered by Frangois Viete in 1579, and is only one of 
many fascinating expressions for 7, some of which are mentioned later. 


i) 


FIGURE 1 


*ChAD ER 


PLANETARY MOTION 


Nature and Nature’s Laws lay hid in night 
God said “Let Newton be,” and all was light. 


Alexander Pope 


Unhke Chapter 16, a short chapter diverging from the main stream of the book, 
this long chapter diverges from the main stream of the book to demonstrate that 
we are already in a position to do some real physics. 

In 1609 Kepler published his first two laws of planetary motion. The first law 
describes the shape of planetary orbits: 


The planets move in ellipses, with the sun at one focus. 


The second law involves the area swept out by the segment from the sun to the 
planet (the ‘radius vector from the sun to the planet’) in various time intervals 


(Figure 1): 


Equal areas are swept out by the radius vector in equal times. (Equivalently, the area 
swept out in time t is proportional to t.) 


Kepler’s third law, published in 1619, relates the motions of different planets. If a 
is the major axis of a planet’s elliptical orbit and T is its period, the time it takes 
the planet to return to a given position, then: 


The ratio a3 /T? is the same for all planets. 


Newton’s great accomplishment was to show (using his general law that the 
force on a body is its mass times its acceleration) that Kepler’s laws follow from the 
assumption that the planets are attracted to the sun by a force (the gravitational 
force of the sun) always directed toward the sun, proportional to the mass of the 
planet, and satisfymg an inverse square law; that is, by a force directed toward 
the sun whose magnitude varies inversely with the square of the distance from the 
sun to the planet and directly with the mass of the planet. Simce force is mass 
times acceleration, this is equivalent simply to saying that the magnitude of the 
acceleration is a constant divided by the square of the distance from the sun. 

Newton’s analysis actually estabhshed three results that correlate with Kepler’s 
individual laws. The first of Newton’s results concerns Kepler’s second law (which 
was actually discovered first, nicely preserving the symmetry of the situation): 
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(b) 


FIGURE 2 
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hepler’s second law 1s true precisely for ‘central forces’, t.e., of and only the force between 
the sun and the planet always les along the line between the sun and the planet. 


Although Newton is revered as the discoverer of calculus, and mdeed invented 
calculus precisely in order to treat such problems, his derivation hardly seems to 
use calculus at all. Instead of considering a force that varies continuously as the 
planet moves, Newton first considers short equal time intervals and assumes that 
a momentary force is exerted at the ends of each of these intervals. 

‘To be specific, let us imagine that during the first time interval the planet moves 
along the line P; P2, with uniform velocity (Figure 2a). If, during the next equal 
time interval, the planet continued to move along this line, it would end up at 
P3, where the length of P; Px equals the length of P2P3. This would imply that 
the triangle $ P; P2 has the same area as the triangle SP) P3 (since they have equal 
bases, and the same height)—this just says that Kepler’s law holds in the special 
case where the force ts 0. 

Now suppose (Figure 2b) that at the moment the planet arrives at P2 it experi- 
ences a force exerted along the line from S to Pz, which by itself would cause the planet 
to move to the point Q. Combined with the motion that the planet already has, 
this causes the planet to move to R, the vertex opposite P2 m the parallelogram 
whose sides are P)P3 and P)Q. 

Thus, the area swept out in the second time interval is actually the triangle 
SPR. But the area of triangle $P)R is equal to the area of triangle SP3 P2, since 
they have the same base SP, and the same heights (since RP3 is parallel to SP). 
Hence, finally, the area of triangle SP)R is the same as the area of the original 
triangle SP; P2! Conversely, if the triangle SRP has the same area as SP; P2, and 
hence the same area as S P3P2, then RP3 must be parallel to SP2, and this implies 
that Q must lie along SP). 

Of course, this isn’t quite the sort of argument one would expect to find in a 
modern book, but in its own charming way it shows physically just why the result 
should be true. 


To analyze planetary motion we will be using the material in the Appendix to 
Chapter 12, and the “determimant” det defined in Problem 4 of Appendix 1 to 
Chapter 4. We describe the motion of the planet by the parameterized curve 


C(t) = FiAcos ee), sure (7), 


so that r always gives the length of the line from the sun to the planet, while 6 
gives the angle, which we will assume is increasing (the case where @ is decreasing 
then follows easily). It will be convenient to write this also as 


(1) c(t) = r(t)- e(O(t)), 


where 
e(f) = (cos sim ft) 


is Just the parameterized curve that runs along the unit circle. Note that 


e’(t) = (—sinf#, cost) 
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c(t) 
c(0) 
FIGURE 3 
c(t +h) 
c(t) 
O 
FIGURE 4 


is also a vector of unit length, but perpendicular to e(7), and that we also have 
(2) det(e(t), e(t)) = 1. 

Differentiating (1), using the formulas on page 247, we obtain 
(3) et) =e) rae Grae Gm) 


and combining with (1), together with the formulas in Problem 6 of Appendix 1 
to Chapter 4, we get 


r(tyr’(t) det(e(O(t)), e(A(t))) + r(1)70'(t) det(e(@(r)), e’((r))) 
= r(t)"6'(t) det(e(O(1)), e(A())), 


det(c(t), hc) 


since det(v, v) is always 0. Using (2) we then get 
B 
(4) dettercaies r-0". 
F > : ; x 
As we will see, 7-0’ turns out to have another important interpretation. 


Suppose that A(t) is the area swept out from time 0 to t (Figure 3). We want 
to get a formula for A‘(t), and, in the spirit of Newton, we'll begin by making 
an educated guess. Figure 4 shows A(t +) — A(t), together with a straight line 
segment between c(t) and c(t +h). It is easy to write down a formula for the area 
of the triangle A(A) with vertices O, c(t), and c(t + 4): according to Problems 4 
and 5 of Appendix | to Chapter 4, the area is 


area(A(h)) = 5 det (c(t), c(t th) —- c@). 


Since the triangle A(/) has practically the same area as the region A(t+/) — A(0), 
this shows (or practically shows) that 


A(t +h) — A(t) 


A’(t) = lim 
h—0 h 
anea ACA) 

shin 

h—-0 h 

t+h)—c(t 

= ! det (co lim eee) 

a h>0 h 


= 4 det(c(1), c'(1). 


A rigorous derivation, establishing more in the process, can be made using Prob- 
lem 13-24, which gives a formula for the area of a region determined by the graph 
of a function in polar coordinates. According to this Problem, we cai write 


A(t) . 
(x) AG) = 5f p(y dp 


A(0) 


_ 


if our parameterized curve c(t) = r(t) - e(@(r)) is the graph of the function p in 
polar coordinates (here we’ve used @ for the angular polar coordinate, to avoid 
confusion with the function 6 used to describe the curve c). 
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Now the function p is just 

Oro eo! 
[for any particular angle ¢, @~'(@) is the time at which the curve c has angular 
polar coordinate @, so r(@~'(r)) is the radius coordinate corresponding to ]. 
Although the presence of the inverse function might look a bit forbidding, it’s 
actually quite innocent: Applying the First Fundamental Theorem of Calculus 
and the Chain Rule to (*) we immediately get 

! 1 2 / 
A'(t) = Ep O(n)? 0) 
) . = 
= },(t)"9'(t), since p =ro@-!, 


Briefly, 


A’ = sr 6". 


5 det(c, c= 517". 


Now we're ready to consider Kepler's second law. Notice that Aepler’s second law 
is equivalent to saying that A’ is constant, and thus it is equivalent to A” = 0. But 


A" = 5[det(c. ei) = 5 det(c’,c’) + 5 det(c, Cc) (see page 248) 


det(c,c’”). 


Ni bl 


Kepler’s second law is equivalent to det(c, c”) = 0. 


Putting this all together we have: 


Kepler’s second law is true if and only if the force 1s central, and in this case each 
planetary path c(t) = r(t) - e(@(1)) satisfies the equation 


A 
(K>) r-0' = det(¢,c ))= Gonstant. 


Saving that the force is central just means that it always pomts along c(t). Since 
/ S P, I S 
c(t) is in the direction of the force, that is equivalent to saying that c(t) always 
points along c(t). And this is equivalent to saying that we always have 


det(c,c¢ )=0. 
We’ve just seen that this is equivalent to Kepler's second law. 


Moreover, this equation implies that [det(c, a) = 0, which by (5) gives (K2). J 
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THEOREM 2 


PROOF 


c(t) 


FIGURE 5 


Newton next showed that if the gravitational force of the sun is a central force 
and also satisfies an inverse square law, then the path of any object in it will be a 
conic section having the sun at one focus. Planets, of course, correspond to the 
case where the conic section 1s an ellipse, and this is also true for comets that visit 
the sun periodically; parabolas and hyperbolas represent objects that come from 
outside the solar system, and eventually continue on their merry way back outside 
the system. 


If the gravitational force of the sun is a central force that satisfies an inverse square 
law, then the path of any body in it will be a conic section having the sun at one 
focus (more precisely, either an ellipse, parabola, or one branch of an hyperbola). 


Notice that our conclusion specifies the shape of the path, not a particular param- 
eterization. But this parameterization is essentially determined by Theorem 1: the 
hypothesis of a central force implies that the area A(t) (Figure 5) is proportional 
to t, so determining c(t) 1s essentially equivalent to determining A for arbitrary 
points on the ellipse. Unfortunately, the areas of such segments cannot be deter- 
mined explicitly.* ‘This means that we have to determine the shape of the path 
c(t) = r(t)- e(@(t)) without finding its parameterization! Since it is the function 
ro@7! which actually describes the shape of the path in polar coordinates, we 
shouldn’t be surprised to find 6~! entering into the proof. 
By Theorem 1, the hypothesis of a central force implies that 


(K>) r°6' = det(c.c’) = M 


for some constant M. The hypothesis of an inverse square law can be written 


(j= A(t 
(*) c(t) 7: e(A(1)) 
for some constant H. Using (K2), this can be written 
ys 9) 
=—-—e : 
0’(t) M . 


Notice that the left-hand side of this equation is 
[c’ 067 !]/(@(t)). 


So if we let 
D=c'o gz! 


(this is the main trick --“we consider c’ as a function of 6”), then the equation can 
be written as 


D'( t ee ( (t)) = —-—/( (1) ()) 
0 — e(@ — "0S A 5 Stull 5 
! M M oe 


* More precisely, we can’t write down a solution in terms of familiar “standard functions,” lke sin, 
arcsin, ete. 
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and we can write this simply as 
; 15 H eh. 
D'(u) = — —(cosu,smu) = | — — cosu, — —sinu 
M M M 


[for all w of the form @(t) for some t], completely eliminating 6. 
‘The equation that we have just obtained is simply a pair of equations, for the 
components of D, each of which we can easily solve individually; we thus find that 


H -sinu H -cosu 


for two constants A and B. Letting u = 6(t) again we thus have an explicit formula 


; M4 ano 
c = | ——— . —— . 
—-M M 


forc: 


[Here sin 6 really stands for sino@, etc., abbreviations that we will use throughout. 
Although we can’t get an explicit formula for c itself. if we substitute this equa- 
tion, together with ¢ = r(cos 6, sin @), into the equation 


det(cca )— uu (equation (K2)), 
we get 


H > H5 , 
r|—cos°-6+ Bcosé + —sin° 6 —Asiné| = M, 
M M 


which simplifies to 


H B Ai 
r ~ + —cos@é — —sin@|= 1. 
M- M M 


Problem 15-8 shows that this can be written in the form 
H 
Te) 77} + Ccos(@(t) + D)| = 1, 


for some constants C and D. We can let D = 0, since this simply amounts to 
rotating our polar coordinate system (choosing which ray corresponds to 6 = 0), 
so we can write, finally, 


But this is the formula for a conic section derived in Appendix 3 of Chapter 4 
(together with Problems 5, 6, and 7 of that Appendix). ff 
In terms of the constant M in the equation 
2) 
la wd 
and the constant A in the equation of the orbit 


rl é cose] = A 
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the last equation in our proof shows that we can rewrite (+) as 


> 


(3k) “t)=-—- e(A(r)). 
: A r(t) 


Recall (page 87) that the major axis a of the ellipse is given by 
. A 
(a) S ———, 
1 —¢- 
while the minor axis is given by 
(b) b= 
Consequently, 


(Cc) —— =; 


Remember that equation (5) gives 


A'(t)'= 


bhoj— 
SD 
II 

Nie 


and thus 
A(t) = 5Mt. 


We can therefore interpret M in terms of the period T of the orbit. This period T 
is, by definition, the value of ¢ for which we have @(t) = 27, so that we obtain the 


complete ellipse. Hence 
area of the ellipse = A(T) = IMT, 


or 


2(area of the ellipse 2ral 
#(area ok tic ellipse) = mee by Problem 13-17. 
T is , 


Hence the constant M*/A in (x) 1s 
M? 4na7b? 
A T2A 


= — using (¢). 


This completes the final step of Newton’s analysis: 


THEOREM 3 __ Kepler’s third law is true if and only if the accelerations c” (1) of the various planets, 


moving on ellipses, satisfy 
” “y l 
c(t) = —G- —e(0(t)) 
r+ 


for a constant G that does not depend on the planet. 


THEOREM 4 


PROOF 
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It should be mentioned that the converse of ‘Theorem 2 is also true. ‘To prove 
this, we first want to establish one further consequence of Kepler’s second law. 
Recall that for 


(a) = (cost, sin f) 
we have 
e’(t) = (—sint#, cost). 
Consequently, 
evn (— cost, — sing, eG 
Now differentiating (3) gives 


(=r (t) e6@) +70 EO) 
+ r/(t)O'(t)-e' (A(t) + r(t)O"(t) - (A(t) + (XO (H)O'(t) - e(O(0)). 
Using e”(t) = =e(¢) we get 
c= [r") = (NO) ] -e@@) + [2r' (HE'D + r(O"(D] - (O(). 


Since Kepler’s second law imphes central forces, hence that c(t) is always a mul- 
tiple of c(t), and thus always a multiple of e(6(t)), the coefficient of e’(6(t)) 
must be O [as a matter of fact, we can see this directly by taking the derivative of 
formula (K2)|. Thus Kepler’s second law mmphes that 


(6) c(t) = [r"(D) — r(0'()7] - e(6(0)). 


If the path of a planet moving under a central gravitational force lies on a conic 
section with the sun as focus, then the force must satisfy an inverse square law. 


As in Theorem 2, notice that the hypothesis on the shape of the path, together 
with the hypothesis of a central force, which is equivalent to Kepler’s second law, 
essentially determines the parameterization. But we can’t write down an explicit 
solution, so we have to obtain information about the acceleration without actually 
knowing what it 1s. 

Once again, the hypothesis of a central force imphes that 


(K2) ro’ = M, 


for some constant M, and the hypothesis that the path les on a conic section with 
the sun as focus implies that it satisfies the equation 


(A) r[l1+ecosé] =A, 


for some € and A. For our (not especially illumimating) proof, we will keep differ- 
entiating and substituting from these two equations. 
First we differentiate (A) to obtain 


r'[1+ecos@] —eré’ sind = 0. 
Multiplymg by r this becomes 


rr’{1 + ecos6] — er*6’siné = 0. 
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Using both (A) and (K>), this becomes 
Ar’ —eMsin@ = 0. 
Differentiating again, we get 
Ar” — eM@' cos6 = 0. 


Using (K2) we get 


5 


“ eM- 
Ar’ — — cosé = 0, 
Ry. 
and then using (A) we get 
no ME 
Ar” — —|——1]=0 
[c- |F 
Substituting from (K2) yet again, we get 
5 
” 2 M- 
Al{r° —r(@')"] + — = 0, 
r- 
or 
2 
PES 
r” at r(é ir i =. 
ATs 
Comparing with (6), we obtain 
2 
et) ==—e(6 (8), 
PH) 
which is precisely what we wanted to show: the force is inversely proportional to 


the square of the distance from the sun to the planet. J 


THE LOGARITHM AND 
CHAPTER EXPONENTIAL FUNCTIONS 


In Chapter 15 the mtegral provided a rigorous formulation for a preliminary def- 
mition of the functions sm and cos. In this chapter the integral plays a more 
essential role. For certain functions even a prelimmary definition presents difficul- 
ties. For example, consider the function 


toa = 10") 
This function 1s assumed to be defined for all x and to have an inverse function, 
defined for positive x, which is the “logarithm to the base 10,” 
(aaea log jy x. 


In algebra, 10° is usually defined only for rational x, while the definition for ir- 
rational x is quietly ignored. A brief review of the definition for rational x will 
not only explain this omission, but also recall an important principle behind the 
definition of 10°. 

The symbol 10" is first defined for natural numbers n. ‘This notation turns out 
to be extremely convenient, especially for multiplying very large numbers, because 


10” ’ Oe _ Ose 


The extension of the definition of 10* to rational x is motivated by the desire 
to preserve this equation; this requirement actually forces upon us the customary 
definition. Since we want the equation 


10° oe 10°" = 10! 
to be true, we must define 10° = 1; since we want the equation 
10-"- 10” = 10° = | 


to be true, we must define 107” = 1/10”; since we want the equation 


10!/" : ; 10!/" =, 1Q W/ate+1/n a, 10! = 10 
n times n times 


to be true, we must define 10!/” = Y10; and since we want the equation 


10!/" aad 19!/" tt) T/nt--+1/n = 1Q"/" 
—_—__—_ —_—_—— 
m times m times 


to be true, we must define 10”/” = (10 ine 

Unfortunately, at this pomt the program comes to a dead halt. We have been 
guided by the principle that 10* should be defined so as to ensure that 10°** = 
10°10"; but this prmeiple does not suggest any simple algebraic way of defining 


o39 
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10* for irrational x. For this reason we will try some more sophisticated ways of 
finding a function f such that 


(*) J @+9) = f(x): f(y) for allSjand y. 


Of course, we are interested in a function which is not always zero, so we might 
add the condition f(1) 4 0. If we add the more specific condition f(1) = 10, 
then (*) will imply that f(+) = 10° for rational x, and 10° could be defined as f(x) 
for other x; in general f(x) will equal [ f(1)]* for rational x. 

One way to find such a function is suggested if we try to solve an apparently 
more difficult problem: find a differentiable function f such that 


f@+y)= f@)- f(y) for all x and y, 


Assuming that such a function exists, we can try to find f’ knowing the derivative 
of f might provide a clue to the definition of f itself. Now 


x+h)— f(x 
f(x) = lim La tala 
h—0 h 


= lim Hes, ‘ fh) a f(x) 
 h0 h 
h)-1 
— /ouaniten es 
h>0 h 
‘The answer thus depends on 
h)-1 

ma00) simi Eh eae 


h—0 h 
for the moment assume this limit exists, and denote it by a. ‘Then 
i (@)=a- f(x) tor alle 
Even if a@ could be computed, this approach seems self-defeating. The derivative 
of f has been expressed in terms of f again. 


If we examine the inverse function f~! = log;,. the whole situation appears in 
a new light: 


I 


log 19 (x) = PEND 


a-f(7 ) Bax 

The derivative of f7~! is about as simple as one could ask! And, what ts even 
b 

more interesting, of all the integrals / x" dx examined previously, the mtegral 
a 

b 
i x! dx is the only one which we cannot evaluate. Since log;,) 1 = 0 we should 
a 


have 


| ee 
al a dt = log yy x = log yo l = log 19 Xs 
l 


DEFINITION 
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x 
This suggests that we define log; x as cj) f t~' dt. The difficulty is that @ is 
] 


unknown. One way of evading this difficulty is to define 


av 1 
logx = | ai. 
; a 


and hope that this integral will be the logarithm to some base, which might be 
determined later. In any case, the function defined in this way is surely more 
reasonable, from a mathematical point of view, than log;g. The usefulness of 
log;g depends on the important role of the number 10 in arabic notation (and thus 
ultimately on the fact that we have ten fingers), while the function log provides a 
notation for an extremely simple integral which cannot be evaluated in terms of 
any functions already known to us. 


li x > 0; then 


The graph of log is shown m Figure |. Notice that if x > 1, then logx > 0, 
and if 0 < x < }, then log x < 0, since, by our conventions, 


x] ] 1 
/ wd =~ | —dt <0. 
jee x f 


For x < 0, a number log x cannot be defined in this way, because f(t) = I/f 1s 
not bounded on [x, I]. 


fiy= + 
fi) == 


area = log x 
S log 


FIGURE 1 


The justification for the notation “log” comes from the folowing theorem. 


ey S Of then 


log(xy) = log x + log y. 
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PROOF Notice first that log’(x) = 1/x, by the Fundamental Theorem of Calculus. Now 
choose a number y > O and let 


f(x) = log(xy). 
Then 
] 


f'(x) = log’ (xy)- y= —- y=-. 
xy x 


Thus f’ = log’. This means that there is a number c such that 
f(x) =logx+ec forall x > 0, 


that is, 
log(xy)=logx+c forall x > 0. 


The number c¢ can be evaluated by noting that when x = | we obtain 


log(1-y) =log]1 +c 
=e: 


Thus 
log(xy)=logx+logy forall x. 


Since this is true for all y > 0, the theorem is proved. § 


COROLLARY 1 If n is a natural number and x > O, then 


log(x") = nlog x. 
PROOF Let to you (use induction). § 


COROLLARY 2 If x, y > 0, then 
x 
log (~) = log x — log y. 
y 


PROOF ‘This follows from the equations 
x x 
log x = log (= v) = log (=) +logy. J 
y sey 


Theorem | provides some important information about the graph of log. ‘The 
function log is clearly increasing, but since log’(x) = 1/x, the derivative becomes 
very small as x becomes large, and log consequently grows more and more slowly. 
It is not immediately clear whether log is bounded or unbounded on R. Observe, 
however, that for a natural number n, 


log(2") =nlog2 (and log2 > 0); 


it follows that log is, in fact, not bounded above. Similarly, 


l 
log (= = log ] — log 2" ——n log 2% 


— 
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therefore log is not bounded below on (0,1). Since log is continuous, it actu- 

2 : ‘ : a jee 
ally takes on all values. Therefore R is the domain of the function log ©. ‘This 
important function has a special name, whose appropriateness will soon become 
clear. 


The graph of exp is shown in Figure 2. Since log x is defined only for x > 0, we 
always have exp(x) > 0. The derivative of the function exp is easy to determine. 


For all numbers x, 
exp (9) = exp(2). 


eplore (oy) (= ——=— 
log (log (x)) 
l 
] 


log! (x) 
= log! (x) = ex) al 


A second important property of exp is an easy consequence of Theorem 1. 


If x and y are any two numbers, then 


exp(x + y) = exp(x) - expty). 


[bet xe=\exp@ randy t= expG)) so that 


/ 
SS Oe ae 
/ 
oly 


Then 


x+y=logx’+logy = log(x’y’). 


This means that 


exp(x + y) = x’y’ = exp(x) - exp(y). I 


This theorem, and the discussion at the beginning of this chapter, suggest that 
exp(1) is particularly nmportant. There is, in fact, a special symbol for this number. 


| e = exp(1). 
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‘This definition is equivalent to the equation 


e | 
I =loge = [ — dt. 
z ob 


As illustrated in Figure 3, 
? 


ca | 
ii —dt <1, since |- (2 — 1) is an upper sum for 
! 


: FO 1/ton [1 2); 
and 
ain 
i) —dt> 1, since 5 -(2—1)+ i - (4 —2) = 1 1s a lower 
by sum for f(t) = 1/t on [1,4]. 
FIGURE 3 ‘Thus 


which shows that 


2<e <4. 


In Chapter 20 we will find much better approximations for e, and also prove that 
é is irrational (the proof is much easier than the proof that z 1s irrational!). 


As we remarked at the beginning of the chapter, the equation 
exp(x + y) = exp(x) - exp(y) 
implies that 


[exp(1)]* 


e’, for all rational x. 


exp(x) 


Since exp is defined for all x and exp(x) = e* for rational x, it is consistent with 
our earlier use of the exponential notation to define e* as exp(x) for all x. 


DEFINITION For any number x, 
Z 


e” = exp(x). 


The terminology “exponential function” should now be clear. We have suc- 
ceeded in defining e* for an arbitrary (even irrational) exponent x. We have not 
yet defined a*, if a # e, but there is a reasonable principle to guide us in the 
attempt. If x is rational, then 

a= (e845 = et loga. 


But the last expression is defined for al/ x, so we can use it to define a*. 


DEFINITION 
ia) = OF 
x \ | 
\ \Fe) = (47 
~~ \ 
ANS a ec) — Ca 
f(x) = (;) CaN 
; ~ AC) =a 
SS 
eae 
FIGURE 4 
THEOREM 4 
PROOF 
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If a > O, then, for any real number +x, 


“= eX loga 


(If a = e this definition clearly agrees with the previous one.) 


‘The requirement a > O is necessary, m order that log a be defined. ‘This is not 
unduly restrictive since, for example, we would not even expect 


ag ple ue ass 


to be defined. (Of course, for certain rational x, the symbol a* will make sense, 
according to the old definition; for example, 


(-1!'P8 = Y-1 =-1)) 


Our definition of a* was designed to ensure that 


(2) axeey aw for allan yy" 


As we would hope, this equation turns out to be true when e is replaced by any 
number a > Q. ‘The proof is a moderately mvolved unraveling of termmology. At 
the same time we will prove the other important properties of a*. 


[edema Oathion 


(De (eyo tor allows 


(Notice that a? will automatically be positive, so (a°)° will be defined); 


xX vi 


(2) al =aanda*t’ =a*-a’ forall x, y. 
(Notice that (2) imphes that this definition of a* agrees with the old one for all 
rational x.) 


»ploga i 5 : 
et log(e = ec (bloga) = ecb loga = a’. 


: anon bs 
(1) (a’)’ = et loga = 
(Each of the steps in this string of equalities depends upon our last definition, or 
the fact that exp = log!) 


(2) a! = el loga = eloga en 


cae eitty) loga ok loga+yloga eX loga , ev loga 


a Seek) | 


Figure 4 shows the graphs of f(x) = a* for several different a. The behavior 
offthestunetionsdependswonewhtther:d < 1, a = |, orma > |. If a = by then 
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f(x) = logyy x 


f@)=log,x@< 1) 


f(®)= Fo= 1. Supposera aly Imethisecase logae=n0 wal hus, 


if xe ye 
then xloga < yloga, 
a et loga Zi loga’ 
1.€., er 


Thus the function f(x) = a* is increasing. On the other hand, if 0 <a < 1, 
so that loga < 0, the same sort of reasoning shows that the function f(x) = a* 
is decreasing. In either case, if a > 0 and a # I, then f(x) = a’ is one-one. 
Since exp takes on every positive value it is also easy to see that a* takes on every 
positive value. ‘Thus the inverse function is defined for all positive numbers, and 
takes on all values. If f(x) =a’, then f~! is the function usually denoted by log, 
(Figure 5). 

Just as a* can be expressed in terms of exp, so log, can be expressed in terms 
of log. Indeed, 


if y = log, x. 
then = a> = e ee 
so logx = yloga, 
log x 
or = S : 
loga 
In other words, 
log x 
log, 2 ; 
loga 


The derivatives of f(x) = a* and g(x) = log, x are both easy to find: 


f= e loga et f'(x) = log a-e loga _ log ea 


log x ; 
gixy=—,  sog’(x) = 


log a xloga’ 


A more comphcated function like 
, -\h(x) 
f(x) = g(x) 
is also easy to differentiate, if you remember that, by definition, 


f (x) = el ®) log g(x). 


it follows from the Chain Rule that 


f Pap eee. fW') log g(x) + h(x) “2 
g(x) 


toga). fie log g(x) + Rene = | 
g(x) 


‘There is no point in remembering this formula —simply apply the principle behind 
it 1 any specific case that arises; it does help, however, to remeniber that the first 


factor in the derivative will be g(x)". 
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There is one special case of the above formula which 7s worth remembering. 
The function f(x) = x” was previously defined only for rational a. We can now 
define and find the derivative of the function f(x) = x* for any number a; the 
result is just what we would expect: 


f(x) L ,~¢ = 64 log x 


sO 
a a n 
f'(x) ee. logx _ x? = ax! 1 
x 2 
Algebraic manipulations with the exponential functions will become second na- 
ture after a little practice—just remember that all the rules which ought to work 


actually do. ‘The basic properties of exp are still those stated in ‘Theorems 2 and 3: 


exp’ (x) = sexp(X), 
exp(x a5 yi exp(x) -exp(y). 


- 


In fact, each of these properties comes close to characterizing the function exp. 
Naturally, exp is not the only function f satisfying f’ = f, for if f = ce*, then 
f'(x) = ce* = f(x); these functions are the only ones with this property, however. 


If f is differentiable and 
f'tx) = f(x) for all x, 
then there is a number c such that 


Foy=ce forallx. 


Let 


TOS 


e* 


g(x) = 
(This is permissible, since e* ¥ 0 for all x.) ‘Then 


S e* f'(x) — f(x)e* 


fe 
g (x) = 0. 
(e*)? 
Therefore there is a number c such that 
x 
fac IO ! —/e Bleeriile, || 
e* 


The second basic property of exp requires a more involved discussion. ‘The 
function exp ts clearly not the only function f which satisfies 


F(x fy) =f) 0). 


In fact, f(x) = 0 or any function of the form f(x) = a” also satisfies this equation. 
But the true story is much more complex than this —there are infinitely many other 
functions which satisfy this property, but it is impossible, without appealing to more 
advanced mathematics, to prove that there is even one function other than those 
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HGH EG 


THEOREM 6 


PROOF 


already mentioned! It is for this reason that the definition of 10* is so difficult: 
there are infinitely many functions f which satisfy 


J tey) = f ) FO). 
Fl) = 10, 


but which are not the function f(x) = 10*! One thmg is true however—any 
continuous function f satisfying 


Cg ee) ya) 


must be of the form f(x) = a* or f(x) = 0. (Problem 38 indicates the way to 
prove this, and also has a few words to say about discontinuous functions with this 
property.) 

In addition to the two basic properties stated in Theorems 2 and 3, the function 
exp has one further property which is very important--exp “grows faster than any 
polynomial.” In other words, 


For any natural number n, 
G 


"aC 
hm — =o. 
x00 x" 


‘The proof consists of several steps. 


Step 1. e* > x for all x, and consequently hm e* = oo (this may be considered 
5 — N82) 
to be the case n = 0). 
‘To prove this statement (which is clear for x < 0) it suffices to show that 


x>logx forall x > 0. 


If x < I this is clearly true, since logx < 0. If x > 1, then (Figure 6) x — 1 is an 
ipper sum for,/ @) ay won| | xii so lox =< ~— 1 = x: 


3, 


ae ane 
Sip.2, ling — seer 
X—0O X 


‘To prove this, note that 


N| x 
i) 


. . . . ap . 
By Step 1, the expression in parentheses is greater than |, and lim e*/* = oo; this 
X—> OO 


shows that lim e*/x = oo. 
I= 00 
a 
Step 3. lim = 0.0), 
x00 x! 


Note that 
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The expression in parentheses becomes arbitrarily large, by Step 2, so the nth 
power certainly becomes arbitrarily large. § 


It is now possible to examine carefully the following very interesting function: 
f(x) =e7'/*" x £0. We have 


12 
fase ss. 


~ 


‘Therefore. 


f'(x) <0 forx <0, 
¥(x) > fora > 0; 


so f is decreasing for negative x and increasing for positive x. Moreover, if |x| is 
i.» De x 2: ° 
large, then x? is large, so —1/x? is close to 0, so e~!/*” is close to 1 (Figure 7). 


FIGURE 7 


rh . . . . . . . oy 5) 
The behavior of f near 0 is more interesting. If x is small, then 1/x~ is large, 
I/<" is large, so e!/* = 1/(e!/") is small. This argument, suitably stated with 


e’s and 4’s, shows that 


so e 
: ie 
lim e!/* =0. 
x—0 
Therefore, if we define 
go, xA0 
0, x:='0: 


then the function f is continuous (Figure 8). In fact, f is actually differentiable 


f(xy= 


FIGURE 8 
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at 0: Indeed 


, aves L/h 
f (O) = lim = lim —;, 
h-O h h—0 e! I/h)- 
and 
: L/h : x : } l/h : 
lim —— = lim —., while lim - = — lim . 
? 
h—0* ef l/h) x00 efx") h—0- e(l/hy x00 eft") 
We already know that 
x ee 
lim — = oo; 
r00 X 
it is all the more true that 
ei) 
lim = 00; 
i900 X 
and this means that 
; x 
lim —— = 0. 
X—>0O em) 
‘Thus 
1 /x? : 0 
./ il alll ee * 
f@)= 3 7 
0, x =0 
We can now compute that 
/ od 2 
, ». £ (h) = f°) 
f" (0) = im ————— 
h—0 h 
@) 
ee. = 
e h3 
= lim 
h—O0 h 
l 
; eee 
. 2% —1/h> © A Ix4 
= lim ee lim — = lim =; 
h—0 h h>0 el/he r00 elt") 


an argument similar to the one above shows that f”(O) = 0. Thus 


i TO —1/x? 4 ; 
f'@y= ot ala . “x6? ae 


0, x= 0; 


This argument can be continued. In fact, using induction it can be shown (Prob- 
lem 40) that f“(0) = O for every k. The function f is extremely flat at 0, and 
approaches 0 so quickly that it can mask many irregularities of other functions. 


For example (Figure 9), suppose that 
—1/x? 


| 

-sn—, x #0 
x 

(), x= 0) 


je 2 
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It can be shown (Problem 41) that for this function it is also true that f (0) = 0 
for all k. ‘Vhis example shows, perhaps more strikingly than any other, just how 
bad a function can be, and still be infinitely differentiable. In Part IV we will 
investigate even more restrictive conditions on a function, which will finally rule 
out behavior of this sort. 


a 
rae 


AC.) = | 


eV sn i/x, x £0 
0, 50 


TG, CORSE, 


PROBLEMS 


1. Differentiate each of the following functions (remember that a” always de- 


notes a), 


i) fase”. 
(1) ne) = log(1 + log(1 Se log(1 ali. eltelt yyy 
an Ge 6 1 ae 
( 
( 
( 


x 2 
Cu dt) 


Iv) «fla = el ; 
eo) = Gua). 
nye f(x) =Hog 4 sim x. 

aN log(sin e*) 
vile ei Jaresin ( 2 ) . 


six 


(viii) f(x) = (log(3 + e*))e* + (aresin xy°83, 
Gy Can — (lor ce” 

a ee ee ae 

( 


x1) 8) — se ye 
2. (a) Check that the derivative of logof is f’/f. 


This expression is called the logarithmic derivative of f. It is often easier 
to compute than f’, since products and powers m the expression for f 
become sums and products in the expression for logo f. The deriva- 
tive f’ can then be recovered simply by multiplying by f; this process is 
called logarithmic differentiaton. 


(b) Use logarithmic differentiation to find f’(x) for each of the following: 


i) f(xyy=(4xy(l +e"). 
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Girx) Fx 
(i —x)(34+x)23° 
(iii) f(x) = (sin.x)°S* + (cos.x)™*. 


oa 


fi) f@)= 


e*—e 


(1v) SX) = eal pe) 


3. Find 
b f(t ) 


—— dt 
, Fa 


(for f > 0 on fa, b}). 
4. Graph each of the following functions. 


(a) f(x) =et!, 

(b) f(x) = em 

(c) fa) =e Fe“ | (Compare the graph with the graphs of exp and 
( 


d) f(x) =e*—e~*.J 1/exp.) 
e—er en ef 2 
SI = yee ee ee 


5. Find the following limits by Hopital’s Rule. 


(i) lm 


x0 x* 
; He Pe x72 — <7 6 
(i) lm ; 
x 0 i 
BR fy x? /2 
P| 
(uu) 46 x3 
: log(1 +x)—x+x7/2 
(iv) 1 ACES : x +x°/ : 
x0 S Row 
Poms, | = x? , 
(v) lim CET eh, 
x0 x- 
. .. log +x) —x 4+x?7/2 — x3/3 
(v1) lim, Soe 
a> x: 


6. Find the following limits by  Hopital’s Rule. 
(1) lim (1 — x)! /*, 
(ii) lim (tanx)"" = 


x 7 


(iii) lim(cosx)!/*", 
x0 


(x. y)ix2+y2?=1} 
(Gos xesin x) 


(cosh x, sinh x) 


(b) 
FIGURE 10 


He 
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‘The functions 


al e* —e 
smh x = ———.,, 
2 
e* +e % 
eosinx = 
2 
e* —e* 5 
tanh x = ———— = ] — ee 
ex +e e-* + | 


are called the hyperbolic sine, hyperbolic cosine, and hyperbolic 
tangent, respectively (but usually read ‘smch,’ ‘cosh,’ and ‘tanch’). There 
are many analogies between these functions and their ordinary trigonometric 
counterparts. One analogy is illustrated in Figure 10; a proof that the region 
shown in Figure 10(b) really has area x/2 is best deferred until the next chap- 
ter, when we will develop methods of computing integrals. Other analogies 
are discussed in the following three problems, but the deepest analogies must 
wait until Chapter 27. If you have not already done Problem 4, graph the 
functions sinh, cosh, and tanh. 


Prove that 


12: 


Y ) 
a) cosh” — smh* 


b) tanh? +1/ cosh? = 1. 


c) smh(x + y) = sinhx cosh y + cosh x sinh y. 
sil) — cosh, 


l 


7 
cosh 


( 
( 
( 
(d) cosh(x + y) = cosh x cosh y + sinh x sinh y. 
( 
( 
( 


) 
) gosh = sinh, 
) 


tanh’ 


The functions sinh and tanh are one-one; their inverses sinh7! and tanh~!, 
are defined on R and (—1, 1), respectively. ‘These inverse functions are some- 
times denoted by arg sinh and arg tanh (the “argument” of the hyperbolic 
sine and tangent). If cosh is restricted to [0, 00) it has an inverse, denoted 
by arg cosh, or simply cosh |, which is defined on [I, co). Prove, using the 
information m Problem 8, that 


(a) sinh(cosh7! x) = Vx? — I. 
(b) cosh(sinh~! x) = V1 +.x?. 
| 
(c) Gine | Cak= =. 
Vl4+x2 
I 
(d) (cosh7!)'(x) = ———-._ forx > |. 
x7 — 1} 
jot asin 7 | 
(e)) (amis 5) G3) =~ for |x| < I. 


] = ye 
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10. 


11. 


12. 


13. 


14. 
15. 


16. 


= pieces 3 Aa | 3 = F 
(a) Find an explicit formula for sinh“, cosh r and tanh”! (by solving the 
; eal j 
equation y = sinh ~ x for x in terms of y, etc.). 
(b) Find 


b 
| 
[S« 
a ¥1+x? 


bh | 


—————= 
a Vx?2-] 


b | 
‘ ——dx for |a|, |b| < 1. 


va | me 


x fora,b>1ora,b< -—l, 


Compare your answer for the third integral with that obtained by writing 


Lame J | 4 | 
fi gee 2% le] 


x ] 
ris i= / dt 
? log t 


is not bounded on [2, 00). 


Show that 


Let f be a nondecreasing function on [1, 00), and define 


FO) = | re at x = 1. 
I 


Prove that f is bounded on [1, 00) ifand only if F/ log is bounded on [1, oo). 
Find 


(a) lim a* for 0 < a < 1. (Remember the definition!) 


X—> OC 
Xx 
(b) lim ———. 
0) x (log.x)” 
; ; ] rx \n 
(c) lm Mega) 
X—> 00 x 


] n 
(—1)" (iog -) 
n x 


(d) lim x(log x)". Hint: x(log x)" = 


% 
(e) lm x’. 
x0" 


Graph f(x) = x* for x > 0. (Use Problem 13(e).) 


(a) Find the minimum value of f(x) = e*/x" for x > 0, and conclude that 
f(x) > e"/n" for x > n. 

(b) Using the expression f’(x) = e*(x — n)/x"+!, prove that f’(x) > 
e"*'/(n + 1)"*! for x > n+ 1, and thus obtain another proof that 
lim: f(%) = 00. 

00 


Graph f(x) = e*/x". 


7. 


18. 
19. 


20. 


21. 


O22) 
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(a) Find lim log(] + y)/y. (You can use PH6pital’s Rule, but that would be 
silly.) 

(b) Find hm x log(1 + 1/x). 

(c) Prové dat a= lim (1 +1/x)*. 

(d) Prove that e? a dim (1+a/x)*. (It is possible to derive this from part (c) 


with just a little algebraic fiddling.) 


aie) Prove that log b= lim x(bl* . ie 
xX—> oOo 


Graph f(x) = (1 + 1/x)* for x > 0. (Use Problem 17(c).) 


If a bank gives a percent interest per annum, then an initial investment / 
yields 1(1 + a/100) after J year. If the bank compounds the interest (counts 
the accrued interest as part of the capital for computing interest the next 
year), then the mitial investment grows to [(1 + 4/100)" after n years. Now 
suppose that interest is given twice a year. ‘The final amount after 2 years 
is, alas, not 1(1 +a/100)?", but merely 1(1 + 4/200)?” —although interest is 
awarded twice as often, the interest must be halved mn each calculation, since 
the interest is a/2 per half year. This amount is larger than /(1 + a/100)", 
but not that much larger. Suppose that the bank now compounds the interest 
continuously, 1.e., the bank considers what the investment would yield when 
compounding k times a year, and then takes the least upper bound of all 
these numbers. How much will an inital investment of | dollar yield after 
I year? 


(a) Let (@) = loc |x|ltor xt 0} Prove thate, OaJ— lore ae) 
(b) If f(x) 40 for all x, prove that (log | f |)’ = f’/f. 


Suppose that on some interval the function f satisfies f’ = cf for some 
number c. 


(a) Assuming that f is never 0, use Problem 20(b) to prove that | f (x)| = /e™ 
for some number / (> 0). It follows that f(+) = ke“ for some k. 

(b) Show that this result holds without the added assumption that f ts 
never 0. Hint: Show that f can’t be 0 at the endpoint of an open 
interval on which it is nowhere 0. 

(c) Give a simpler proof that f(x) = ke™ for some k by considering the 
fincion, °() = {ya 

(d) Suppose that f’ = fg’ for some g. Show that f(x) = ke? for some k. 


A radioactive substance diminishes at a rate proportional to the amount 
present (since all atoms have equal probability of disintegrating, the total 
disintegration is proportional to the number of atoms remaining). If A(r) 
is the amount at time ¢, this means that A’(t) = cA(t) for some ¢ (which 
represents the probability that an atom will disintegrate). 


(a) Find A(t) in terms of the amount Ag = A(Q) present at ume 0. 
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235 


os 


Aa 


26. 


27. 


28. 


29. 


30. 


(b) Show that there is a number t (the “half-life” of the radioactive element) 


with the property that A(t + 1) = A(t)/2. 


Newton’s law of cooling states that an object cools at a rate proportional to 
the difference of its temperature and the temperature of the surrounding 
medium. Find the temperature T(t) of the object at time f, in terms of its 


temperature 79 at ume Q, assuming that the temperature of the surrounding 
medium is kept at a constant, M. Hint: ‘To solve the differential equation 


expressing Newton’s law, remember that 7’ = (T — M)’. 
Xx 
Prove thathirey (7) = / ¥@) desthen f = 9. 
0 


Find all continuous functions f satisfying 


] 
Find all functions f satisfying f’(t) = f(t) +f f(t) dt. 
0 


Find all continuous functions f which satisfy the equation 
i 
lene 


(a) Let f and g be continuous functions on [a, 6] with g nonnegative 
pose that for some C we have 


dt. 


Gita): =| f(y) 


pect f v2. Ome xee b. 


Prove Gronwall’s inequality: 


(3) = Cel. ®. 


. Sup- 


Hint: Consider the derivative of the function h(x) = (C + i. fa)e ip oe 
(b) Let f and g be nonnegative functions with g continuous and f difleren- 
fable. Supposemhar f(x) = g(x) 7 (x) and f (0) =O pProvethamy = 0) 


(Compare Problem 21.) 


(a) Prove that 
x2 DS x" 
l+x+—+—+---+—<e* forx=0. 
2p m3! n! 
Hint: Use induction on 2, and compare derivatives. 
(b) Give a new proof that lim e*/x" = oo. 
: XO 


Give yet another proof of this fact, using the appropriate form of PH6pital’s 


Rule. (See Problem 11-56.) 


31. 


OZ: 
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x 


2 ; 2 ) 
: e’ dt. (You should be able to make an educated 


(a) Evaluate lim e7 
X00 ) 
guess before doing any calculations.) 


(b) Evaluate the following limits. 


ge few] x) F 
Gj) . lim e * | Ge at. 
: 


X00 
2 xt(logx)/x  , 
ai) lim e* | e" dt. 
x00 
x 
safe On)/2x wy 
(iii) lim e* i é dl. 
x00 ; 


‘This problem outlines the classical approach to logarithms and exponentials. 
‘To begin with, we will simply assume that the function f(x) = a*, defined in 
an elementary way for rational x, can somehow be extended to a continuous 
one-one function, obeying the same algebraic rules, on the whole line. (See 
Problem 22-29 for’a direct proof of this.) The inverse of f will then be 


_denoted by log,. 


(a) Show, directly from the definition, that 
h l/h 
log (x) — sla eee 
l fi: L/k 
= —-log i hm(l + %) : 
x k>0 


Thus, the whole problem has been reduced to the determination of 
lim(1 + h)!/". If we can show that this has a limit e, then log (x)= 
h-0 


me logele’ = . and consequently exp = log, | has derivative exp’(x) = 
EXD). 


Lay 
(b) Let a, = {| 1+—) for natural numbers 7. Using the binomial theorem, 
Ae 


show that 


ee l 2 k—1 
a, =2+ —|{]—— ee _ 5 
k! n n n 
k=2 


Conclude that ay < ay+1. 

(c) Using the fact that 1/k! < 1/2*-! for k > 2. show that all a, < 3. Thus, 
the set of numbers {a1, a2. a3,...} 1s bounded, and therefore has a least 
upper bound e. Show that for any ¢ > 0 we have e — a, < € for large 
enough n. 

(d) Ifn <x <n +1, then 


| n 1 x 1 n+] 
Ae Se ce 
n+] x n 
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es 
v 


FIGURE 11 


+35) 


734: 


hae OO X-> —0O 


l xX l Xx 
Conclude that lim ( + =) =e. Also show that lim (1 + -) — es 
x By 


and conclude that lim(] +h)!/" =e. 


h-0 


A point P is moving along a line segment AB of length 10’ while another 
pomt Q moves along an infinite ray (Figure 11). The velocity of P is always 
equal to the distance from P to B (in other words, if P(t) is the position of P 
at time ft, then P’(t) = 10’ — P(t)), while Q moves with constant velocity 
Q'(t) = 10’. The distance traveled by Q after time ¢ is defined to be the 


Napierian logarithm of the distance from P to B at time ¢t. Thus 


10’ = Nap log[10’ — P(t)]. 


This was the definition of logarithms given by Napier (1550-1617) in his 
publication of 1614, Afinfici logarithmonum canonis description (A Description of 
the Wonderful Law of Logarithms); work which was done before the use of 
exponents was invented! The number 107 was chosen because Napier’s ta- 
bles (intended for astronomical and navigational calculations), listed the loga- 
rithms of sines of angles, for which the best possible available tables extended 
to seven decimal places, and Napier wanted to avoid fractions. Prove that 


107 

Nap log x = 107 log —. 
i 

Hint: Use the same trick as in Problem 23 to solve the equation for P. 


(a) Sketch the graph of f(x) = (log x)/x (paying particular attention to the 
behavior near 0 and ov). 

(b) Which is larger, e” or 2°? 

(c) Prove that if 0 < x < 1, or x = e, then the only number y satisfying 
x? = y* is wx but if x > 1, x ~e, then there is preciselyone muimber 
yi satishane y= y"; moreover, if x < e, then y > ¢, andul xe 
then y <e. (Interpret these statements in terms of the graph in part (a)!) 

(d) Prove that if x and y are natural numbers and x” = y*, then x = y or 
fH, y = Soe = 4, y = 2. 

(ec) Show that the set of all pairs (x, y) with x” = y* consists of a curve and 
a straight line which intersect; find the intersection and draw a rough 
sketch. 


**(f) For 1 <x < e let g(x) be the unique number > e with x8 = g(x)". 


Prove that g is differentiable. (It is a good idea to consider separate 
functions, 
log x 


ie) — , U'= w=~@ 


log x 
ee : 2 <aee 


fox) = 


+30; 


36. 


3. 
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and write g in terms of fj and f2. You should be able to show that 


[g(x]? . Were se 


g(x) = 5 


a, — log g(x) Xe 


if you do this part properly.) 


‘This problem uses the material from the Appendix to Chapter 11. 


(a) 
(b) 


(b) 


Prove that exp is convex and log is concave. 
HW 


Prove that if Ye pi = | and all p; > 0, then for all z; > 0 we have 


i=] 
ar Pod Os ees SS DI ai alate Pucn- 


(Use Problem 8 from the Appendix to Chapter 11.) 
Deduce another proof that G, < A, (Problem 2-22). 


Let f be a positive function on [a, b], and let P,, be the partition of [a, b] 
into 7 equal intervals. Use Problem 2-22 to show that 


l 
Lilog f, Py) < log ( ECE P)), 
~\b-a 


b-—a 


Use the Appendix to Chapter 13 to conclude that for all integrable f > 0 


1 b 1 b 
/ log F = log ( / f). 
b—-—a Ja b-—a Ja 


we have 


A more direct approach 1s illustrated in the next part: 


(c) 


In Problem 35, Problem 2-22 was deduced as a special case of the in- 


equality 
Wl n 
g (> na aS S— pig(xi) 
i i=] 


n 
IOP pie ) pi = 1 and g convex. For g concave we have the reverse 
il 


inequality 
n ft 
S- pighi) <8 (> na 
ial =i 


Apply this with g = log to prove the result of part (b) directly for any 
integrable f. 


(d) State a general theorem of which part (b) is just a special case. 


Suppose f satisfies f’ = f and f(x + ») = f(x) f() for all x and y. Prove 
diate t= cxpromeya— 6: 
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*38. 


eo 


*40. 


*41. 


42. 


ea): 


*(c) Prove that if @w is a root of (*) of order r, then y(x) = x 
el ~ 


Prove that if f 1s continuous and f(x + y) = f(x) f(y) for all x and y, then 
either f = 0 or f(x)i="[/ ())* for alle Ent Showthat yi)" 
for rational x, and then use Problem 8-6. ‘This problem is closely related to 
Problem 8-7, and the information mentioned at the end of Problem 8-7 can 
be used to show that there are discontinuous functions f satisfying f(x+y) = 


FfA)FQ). 


Prove that if f is a continuous function defined on the positive real numbers, 
andey (4y) = { (x) Saf (yy for all positime x andeysthen)/je—s0rorey (jr 
f(e) log x for all x > 0. Hint: Consider g(x) = f(e’). 


Prove that if f(x) = ent? fora = 0) and f~(0) = 0) then f®(0) — tet 
all k (you will encounter the same sort of difficulties as in Problem 10-21). 
Hint: Consider functions g(x) = e!/* P(1/x) for a polynomial function P. 


Prove that if f(x) = e7!/ sin 1/x for x #0, and f(0) =0, then f“(0) = 0 
for all k. 


(a) Prove that if @ is a root of the equation 
(k)  @yx" + Go we Ae Ee ea 
then the function y(x) = e satisfies the differential equation 


(ee) any” + a,_py ) +--+ ay’ +aoy = 0. 


*(b) Prove that if @ is a double root of (*), then y(x) = xe also satisfies (*+). 


Hint: Remember that if @ is a double root of a polynomial equation 
T Cox Oatlien of a) = 0, 


k 


gas 


is a solution 
forO<k<7r-— 1. 


If (x) has » real numbers as roots (counting multiplicities), part (c) gives 


n solutions yj,..-, Yn Of (**). 


(d) Prove that in this case the function c)y; +--+ +¢ny¥, also satisfies (*+). 


It is a theorem that in this case these are the only solutions of (**). Prob- 
lem 21 and the next two problems prove special cases of this theorem, 
and the general case is considered in Problem 20-26. In Chapter 27 we 
will see what to do when (x) does not have n real numbers as roots. 


Suppose that f satisfies f” — f =O and f(0) = f’(0) =0. Prove that f =0 
as follows. 
(a) Show that! f* — (f)* = 0. 


(b) Suppose that f(x) 4 0 for all x in some interval (a,b). Show that either 
f(x) =.ee* or alse £(@) = een for all x ini(@, >), for some consianec 


**(c) If f(x) 4 0 for x9 > 0, say, then there would be a number a such that 


0 < @< xpvaid f(a) = 0, while fx) 4 0 for@ < x < x. Why? Use 
this fact and part (b) to deduce a contradiction. 


"44, 


aoe 


*46. 


a7, 
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(a) Showasthat af yisatishies: (— f= 0, then ((4) = ae be for 
some a and b. (First figure out what a and b should be in terms of f(Q) 
and f’(Q), and then use Problem 43.) 


(b) Show also that f = asinh +b cosh for some (other) a and b. 


Find all functions f satisfying 


(a) f™ = lr Oe 
(b) ye) =a) 5 plana 


This problem, a companion to Problem 15-30, outlines a treatment of the ex- 

yonential function starting from the assumption that the differential equation 
5 

f' = f has.a nonzero solution. 


(a) Suppose there is a function f #4 0 with f’ = f. Prove that f(x) 4 0 for 
each x by considering the function g(x) = f(xo + x) f(xo — x), where 


Lio) 

(b) Show that there is a function f satisfying f’ = f and f(0) = 1. 

(c) For this f show that f(x +y) = f(x)- f(y) by considering the function 
g(x) = f(x + y)/fO). 

(d) Prove that f is one-one and that (f~!)/(x) = 1/x. 


Let f and g be continuous functions such that lim f(x) = hm g(x) = oo. 
3 ie lS) x OO 


We say that f grows faster than g (f > g) if 


For example, for any polynomial function P with lim P(x) = oo (i.e., P 
X— 0O 


is non-constant and has positive leading coefficient) we have exp >> P and 
P > log" for any positive integer 7. 


(a) Given f and g, with dm Te) = dim @(%)) = co, is it meaessarily true 
that one of the three conditions ap > 2 org > or f ~ ¢@ holds: 

bl 7 > 2, then (ee 7. 

Komi 


for sufficiently large x, then f > g. 


xX 


ia) ii Ff > 2 and F(x) = | AA = / g, does it necessarily follow 
0 JO 
that F > G? 
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48. 


49. 


(e) Arrange each of the following sets of functions in increasing order of 
growth (for convenience, we indicate each function simply by giving its 
value at x): 


(i) x, e%, x° + log(x>), log 4x, dogx)*, x*, x +e *, x? log x. 


~. ? 2 é 
Gi) xloe~x,e*, loe(x*);.e" , x*; x!8* (log x)*. 


r 


PE 5 5 ay) fo} > 
(i) 8 xe xe, 2 Ee", dog x). 


Suppose that g), g2, g3,... are continuous functions. Show that there is a 
continuous function f which grows faster than each g;. 


Prove that log), 2 is irrational. 


Cia bal E.R 9 INTEGRATION IN ELEMENTARY TERMS 


Every computation of a derivative yields, according to the Second Fundamental 
‘Theorem of Calculus, a formula about integrals. For example, 


if FX) = Cos )'— x then ay leas 


consequently, 


b 
J togxax = FO) — Fla) = bllogd) ~ b — [atloga) ~ a), 0% at? 


a 
Formulas of this sort are simplified considerably if we adopt the notation 


b 
F(x)| = F(b)— F(a). 
We may then write 
b 


b 
it log vadx= x(logx) — x 
a a 
re d 5 ~ ph ° . 
This evaluation of [; log x dx depended on the lucky guess that log is the deriva- 
tive of the function F(x) = x(logx)—x. In general, a function F satisfying F’ = f 
is called a primitive of f. Of course, a continuous function f always has a 


ris f le 


but in this chapter we will try to find a primitive which can be written in terms of 
familiar functions like sin, log, ete. A function which can be written in this way 
is called an clementary function. ‘To be precise,* an elementary function is 


rimitive, namely, 
5 


one which can be obtained by addition, multiplication, division, and composition 
from the rational functions, the trigonometric functions and their inverses, and the 
functions log and exp. 

It should be stated at the very outset that elementary primitives usually cannot 
be found. For example, there is no elementary function F such that 


lL e\= e-* for all x 


(this is not merely a report on the present state of mathematical ignorance; it is 
a (difficult) theorem that no such function exists). And, what is even worse, you 


* The definition which we will give is precise, but not really accurate, or at least not quite standard. 
Usually the elementary functions are defined to include “algebraic” functions, that is, functions g 
satisfying an equation 


(g(x))" + fr-1 (ey! + + fair) = 0, 


where the fj are rational functions. But for our purposes these functions can be ignored. 
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will have no way of knowing whether or not an elementary primitive can be found 
(you will just have to hope that the problems for this chapter contain no misprints). 
Because the search for elementary primitives is so uncertain, finding one is often 
peculiarly satisfying. If we observe that the function 


log(t + x) 
2 


F(x) =x arctan x — 


satisfies 
F’(x) = arctan x 


(just how we would ever be led to such an observation is quite another matter), so 
that 
2 
log(1 + x7) 
) fe 
a 


b 
fl arctan x dx = x arctanx — 
a 


then we may feel that we have “really” evaluated _[ : arctan x dx. 

This chapter consists of little more than methods for finding elementary prim- 
itives of given elementary functions (a process known simply as “integration”), 
together with some notation, abbreviations, and conventions designed to facilitate 
this procedure. This preoccupation with elementary functions can be justified by 
three considerations: 


(1) Integration is a standard topic im calculus, and everyone should know 
about it. 

(2) Every once in a while you might actually need to evaluate an integral, under 
conditions which do not allow you to consult any of the standard integral 
tables (for example, you might take a (physics) course in which you are 
expected to be able to integrate). 

(3) The most useful “methods” of integration are actually very important the- 
oremis (that apply to all functions, not just elementary ones). 


Naturally, the last reason is the crucial one. Ewen if you intend to forget how 
to integrate (and you probably will forget some details the first tme through), you 
must never forget the basic methods. 

These basic methods are theorems which allow us to express primitives of one 
function in terms of primitives of other functions. ‘To begm integrating we will 
therefore need a list of primitives for some functions; such a list can be obtamed 
simply by differentiating various well-known functions. The lst given below makes 
use of a standard symbol which requires some explanation. ‘The symbol 


fy or [ fovas 


means “a primitive of f” or, more precisely, “the collection of all primitives of f.” 
The symbol f f will often by used im stating theorems, while f f(v) dx is most 
useful in formulas hke the following: 


at 

3 x 
xvdx=—. 
ic aX 4 


19. Integration in Elementary Terms 365 


This “equation” means that the function F(x) = x4/4 satisfies F’(x) = x3. It 
cannot be interpreted literally because the right side is a number, not a function, 
but in this one context we will allow such discrepancies; our aim is to make the 
integration process as mechanical as possible, and we will resort to any possible 
device. Another feature of the equation deserves mention. Most people write 


4 
[eas tee 
4 


to emphasize that the primitives of f(x) = x? are precisely the functions of the 
form F(x) = x+/44+C for some number C. Although it is possible (Problem 14) 
to obtain contradictions if this point is disregarded, in practice such difficulties do 
not arise, and concern for this constant is merely an annoyance. 

There is one important convention accompanying this notation: the letter ap- 
pearing on the right side of the equation should match with the letter appearing 
after*the “d "on theleitside——thus 


3 us 
ue du = —, 
4 
/ 1x 
Ux GS == = 


, x12 
(eel = a 


A function in f f(x)dx, ie., a primitive of f, is often called an “indefinite 
integral” of f, while [ z f (x) dx is called, by way of contrast, a “definite integral.” 
This suggestive notation works out quite well in practice, but it is important not to 
be led astray. At the risk of boring you, the following fact 1s emphasized once again: 
the integral he f(x) dx 1s not defined as “F(b) — F(a), where F is an indefinite 
integral of f” (if you do not find this statement repetitious, it is time to reread 
Chapter 13). 

We can verify the formulas in the following short table of indefinite integrals 
simply by differentiating the functions indicated on the right side. 


fe din Gis 


yrtl 
id ; —| 
KX Aly 1 


l l : dx . ; a. 
nae = OOK —dx is often written {| — for convenience; similar 
x Xx x 


abbreviations are used in the last two examples of this 
table.) 
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THEOREM 1 (INTEGRATION BY PARTS) 


[cos ax = Sine 
2 
SEC Xi 1 — an 


Seex tala. = sec x 


| dx 
——~ = arctan x 
ee 


dx : 
——————__ = arcsin x 
3) 
Vl—x- 


Two general formulas of the same nature are consequences of theorems about 
differentiation: 


[iseot sco dx =f fous + f eeods, 


fe-feoas =o: f fond. 


These equations should be interpreted as meaning that a primitive of f + g can 
be obtained by adding a primitive of f to a primitive of g, while a primitive of 
c- f can be obtained by multiplying a primitive of f by c. 

Notice the consequences of these formulas for definite integrals: If f and g are 
continuous, then 


b b b 
i [f@) + g(x)] dx = finds + f 2a(x)dx, 


b 


b 
i, Gem Gi )ax =c- MAX, 


a 

These follow from the previous formulas, since each definite integral may be writ- 
ten as the difference of the values at a and b of a corresponding primitive. Con- 
tinuity is required in order to know that these primitives exist. (Of course, the 
formulas are also true when f and g are merely integrable, but recall how much 
more difficult the proofs are in this case.) 

The product formula for the derivative yields a more interesting theorem, which 
will be written in several different ways. 


If f’ and g’ are continuous, then 


[ fe =F -| Fs 
il fed@\dx = ech i fwewds, 
b b 


PF ee naw. 


a 7a 


b 


POSC ax Sos) eG) 


a 


(Notice that in the second equation f(x)g(x) denotes the function f - 8.) 


PROOF 
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The formula 
Co) mee fat 
can be written 


fe = (fay — f'g. 


[se = fose’- f rs. 


and fg can be chosen as one of the functions denoted by {(fg)’. This proves the 
first formiula. 


Thus 


The second formula is merely a restatement of the first, and the third formula 
follows immediately from either of the first two. 


As the following examples illustrate, tegration by parts 1s useful when the func- 
tion to be integrated can be considered as a product of a function f, whose deriva- 
tive is simpler than f, and another function which is obviously of the form g’. 


fre dx =the = i |. gaad % 
+1 1 + ¢ 
fogs We As 
Se 
[sins as XS cos. x) -{ I -(—cosx)dx 
1 J + 1 J 1 
i 2 fi ie um 8g 


= —xcosx +sinx 


There are two special tricks which often work with integration by parts. ‘The 
first is to consider the function g’ to be the factor 1, which can always be written 


ii i / | -logxdx =x logx — / %-(1/x)dx 


A ae! eel dog tl 
gf ey (eee 


= MOS ie xe 


ill. 


The second trick is to use integration by parts to find { f in terms of fh again, 
and then solve for { h. A stmple example is the calculation 


/ (1/x)-logx dx = log x -logx — / (1/x)-logx dx, 
J J J J J y 
gS f aa 3 Beas 


which implies that 


| 
2 | —log ar = (log x)? 
x 
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or 


l ; 2 
/ —logxdx = Vesey 
Xx 2 


A more complicated calculation is often required: 


[esin x dx =e «(—cosx)'— / e* - (—cosx) dx 


44 J 1 4 1 
is f g Fy g 
= —e"* cosx + / e*cosx dx 
 Y 
uv 
= —e* cosx + [e* - (sinx) — fe (sin x) dx]; 
J Y + 4 
u v ue op 
therefore. 
2 / e* sinx dx = e*(sinx — cosx) 
or 


me e* (sin x — cosx) 
e’snxdx = ————_—_—_——_. 
2 


Since integration by parts depends upon recognizing that a function is of the 
form g’, the more functions you can already integrate, the greater your chances for 
success. It is frequently reasonable to do a preliminary integration before tackling 
the main problem. For example, we can use parts to integrate 


[oe x)?dx = / (log x )(log x) dx 
1 { 
tr 
if we recall that [log x dx = x(logx) — x (this formula was itself derived by inte- 


gration by parts); we have 


/ (log x)(log.x)dx = (log x)|x(log.x) — x] — [ droog) —x|dx 
1 { J { 1 4 
, ££ f g a 8 
= (log x)[x(log x) — x] — [ess — I] dx 
= (log x)[x (log x) — x] — [ve x dx + / ldx 


(log.x)[x(logx) — x] — [x(logx) — x] +x 


i 
= x(logx)*“ — 2x(log x) + 2x. 

The most important method of integration is a consequence of the Chain Rule. 

The use of this method requires considerably more ingenuity than integrating by 

parts, and even the explanation of the method is more difficult. We will therefore 
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develop this method in stages, stating the theorem for definite integrals first, and 
saving the treatment of indefinite integrals for later. 


THEOREM 2 If f and g’ are continuous, then 


(THE SUBSTITUTION FORMULA) e(b) b 
‘| i= il (fog)-g' 
2(a) a 


b 
f(u)du= i Fete) 2 ax. 


g(b) 
g(a) 


PROOF If F is a primitive of f, then the left side is F(g(b)) — F(g(a)). On the other 
hand, 


(Feig) = (F 0%) pa = (7 og) ig, 
so F o g is a primitive of (f o g)- g’ and the right side is 


(F og)(b) — (F og)(a) = F(g(b)) — F(g(a)). 


The simplest uses of the substitution formula depend upon recognizing that a 
given function is of the form (f o g)- g’. For example, the integration of 


b b 
- 5 , 5 
is sir x cos x dx ( = (six) Gos x ar) 
a a 


is facilitated by the appearance of the factor cos.x, which will be the factor g’(x) 
for g(x) = sinx; the remaining expression, (sin x), can be written as (g(x) = 


f(g(x)), for f(u) =u. Thus 


fi 5 os) — sie 
Str X COS. dx 5 
a fu) =i 


b g(b) 
= Wee) ) cay ae fw) du 
a g(a) 
i eer i, sin°b — sin°a 
= urdu = = 
sina 6 6 


The integration of f, tan.x dx can be treated similarly if we write 


” Se sin * 
tanxdx = — —— dx. 
a a COSX 


In this case the factor — sin x is g’(x), where g(x) = cos.x; the remaining factor 
ly/ cosx can then’ be written f(cos x) for f(@) = b/a. Hence 


b a(x) = Coan 
/ tan x dx ] 
a fu) a 


u 


b g(b) 
— | f(g))g'@)dx = 4) f(u) du 
g 


a (a) 


cosb 
= -| —du = log(cosa) — log(cosb). 


[OS a u 
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Finally, to find 


Bye I 
P22 
a Xlogx 


notice that 1/x = g(x) where g(x) = log x, and that 1/log x = f(g(x)) for 
f(u) = 1/u. Thus 


b | g(x) = logx 
—— dx - I 
a Xlogx fw=- 
u 


b g(b) 
= f(gx))g (x)dx = / f(u) du 
g 


a (a) 


logb 
= ih — du = log(log b) — log(log a). 
loga U 
Fortunately, these uses of the substitution formula can be shortened considerably. 
The intermediate steps, which involve writing 


b g(b) 


f(g(x))2'(x) dx = Hl f(u)du, 


2(a) 


a 


can easily be eliminated by noticing the following: ‘To go from the left side to the 
right side, 


, u for g(x) 
substitute ; 
du tor g(x) dx 


(and change the limits of integration); 


the substitutions can be performed directly on the original function (accounting 
for the name of this theorem). For example, 


b Steep Ey sin b 
2» 6 u for sin x 5 
sin> x cos. x dx | substitute k a uo du, 
Is du for cosx dx : 


ina 


and similarly 


> _ sin x ; u for cosx eos. 
—— dx | substitute Z = — du. 
5 osx du for —sinx dx ul 


(OS a 
Usually we abbreviate this method even more, and say simply: 


Wet m= 2) 
du = g(x) dx.” 


b | let ou log Xx log b | 
| ——- dx l — / —du. 
a Xlogx du = —dx | u 


ogra 
Xx > 


‘Thus 


In this chapter we are usually interested in primitives rather than definite in- 
. . ‘Di x . . ~ 
tegrals, but if we can find [, f(«)dx for all a and b, then we can certainly find 


a 
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J f(x)dx. For example, since 


; 6326 
eae sin°b  sin°a 
sin” X cos x:dx = = ; 


6 6 


a 


it follows that 


- 6 
. 5 sin x 
sin’ x cosx dx = 5° 


Sunilarly, 


i tanx dx = —logcos x, 
l 

—— dx = log(log x). 

x log x air 


It is quite uneconomical to obtain primitives from the substitution formula by first 
finding definite integrals. Instead, the two steps can be combined, to yield the 
following procedure: 


u = g(x), 
du = g(x) dx; 


(after this manipulation only the letter u should appear, not the 
letter x). 

(2) Find a primitive (as an expression involving u). 

(3) Substitute g(x) back for wu. 


Thus, to find 


‘AS 
/ sin’ x cos x dx, 


u = sinx, 


(1) let 


du = cosx dx 


iG du: 
6 
u 
fe du= —; 
6 


(3) remember to substitute sin. x back for uw, so that 


v6 
oe sin x 
sin” x cosx dx = ; 

6 


so that we obtain 


(2) evaluate 
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Similarly, if 


u = log x, 
l 
du = — dx, 
x 
then 
l I 
——dx becomes —du = logu, 
x log x u 
so that 


l 
i! xlogx an = log (log Xi): 


x 
——~ dx, 
lm ‘ 


u=1+x", 
du = 2x dx: 


‘To evaluate 


let 


the factor 2 which has just popped up causes no problem—the integral becomes 


Lf l 
>| — du = 5 logu, 


~ ~ 


x I 2 
‘i <i = 5) log(1 2 We 


(This result may be combined with integration by parts to yield 


x 
/ 1 - arctan x dx = x arctan x — / — dv 
1+ x- 


? 
= x arctan x — 5 log(1 + x*), 


a formula that has already been mentioned.) 

These applications of the substitution formula* illustrate the most straight- 
forward and least interestng types—once the suitable factor g’(x) 1s recognized, 
the whole problem may even become simple enough to do mentally. ‘The following 
three problems require only the information provided by the short table of indefi- 
nite integrals at the beginning of the chapter and, of course, the right substitution 


* The substitution formula is often written in the form 
[ fend = [ Fegons'ordr, u = g(x). 


This formula cannot be taken literally (after all, f f(u)du should mean a primitive of f and the 
symbol { f(g(«))g'(x)dx should mean a primitive of (f © g) - 9’; these are certainly not equal). 
However, it may be regarded as a symbolic summary of the procedure which we have developed. If 
we use Leibniz’s notation, and a little fudging, the formula reads particularly well: 
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(the third problem has been disguised a little by some algebraic chicanery). 
if sec” x tan? x dx, 


[cos x)e™* dx, 


x 
/ ee 4 

If you have not succeeded in finding the right substitutions, you should be able to 
guess them from the answers, which are (tan® x)/6, e™*, and arcsin e*. At first you 
may find these problems too hard to do in your head, but at least when g is of the 
very simple form g(x) = ax + 6 you should not have to waste time writing out the 
subsutution. The following integrations should all be clear. (The only worrisome 
detail is the proper positioning of the constant—should the answer to the second be 
e** /3 or 3e*? T always take care of these problems as follows. Clearly pe a 
e*. (something). Now if I differentiate F(x) = e7*, I get Fie) = 3e3*, so the 
“something” must be i, to cancel the 3.) 


| dx ee ea, 
x 


sin 4x 
cos 4x dx = : 


4 
— cos(2x + 1 
[ snes lax ea 
ax __ arctan 2% 
Pe 4r2 © > 


More mteresting uses of the substitution formula occur when the factor g’(x) 
does not appear. ‘There are two main types of substitutions where this happens. 


] x 
/ aw ax: 
| —e* 


The promiment appearance of the expression e* suggests the simplifying substitu- 
Oc é te) 


Consider first 


tion 
H=e", 
du =e“ dx. 


Although the expression e* dx does not appear, it can always be put in: 


l+e* I+e* | 
ea -.-—-e dx. 
1-—e l—e* e 


We therefore obtain 
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which can be evaluated by the algebraic trick 


| | p ] 
i +u ay / 424i —2log(1 — tt) + log u, 
l—u u l—u | 


u 


so that 


] x . > 
ye i ak dx = —2log(1 —e*) + loge* = —2log(1 — e*) + x. 
= e* 


There is an alternative and preferable way of handling this problem, which does 


not require multiplying and dividing by e*. If we write 


Le, x =logu, 
l 
dx = —du, 
u 
then 
+e , : +u 
-dx immediately becomes -—du. 
1—e* l—u u 


Most substitution problems are much easier if one resorts to this trick of express- 
ing x in terms of u, and dx in terms of du, instead of vice versa. It is not hard to 
see why this trick always works (as long as the function expressing uv in terms of x 
is one-one for all x under consideration): If we apply the substitution 


io =p (x), — eg! (uw) 
dx =(g~')'(u) du 


/ f(g(x)) dx, 


(1) [ fens du. 


On the other hand, if we apply the straightforward substitution 


to the integral 


we obtain 


u = g(x) 
du = g'(x)dx 


to the same integral, 
! : 
-g (x)dx, 


[ Feewnas = [ few 
g' (x) 


| 
(2) [fe Se du 
g'(g-*(u)) 


The integrals (1) and (2) are identical, since (g~')'(u) = 1/g'(g~'(u)). 
As another concrete example, consider 


en 
————-. dx. 
| / ex | 
vVeé Ir 


we obtain 
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In this case we will go the whole hog and replace the entire expression ye‘ + | 
by one letter. Thus we choose the substitution 


u=VJVer+l, 


2) 
u“ =e +1, 


? 9 
“uwe—A=e*, x log —1); 
2u 
dx= = du 
u-—]| 
The integral then becomes 
(u2 — 1)? 2u 2us 
——— - ~—— du =2 | u* — 1du = — —-2u. 
u u-— 1 3 
Thus 
2x 
e 


2 > : 
———— dx = ate + 13/7 — 2(e* + 1)!/2, 
Jer = 3 


Another example, which illustrates the second main type of substitution that can 


/ JI — x2 dx. 


In this case, instead of replacing a complicated expression by a simpler one, we 


will replace x by sinu, because ¥ 1 — sin° u = cosu. This really means that we 
are using the substitution uv = arcsin x, but it is the expression for x in terms of u 


occur, is the integral 


which helps us find the expression to be substituted for dx. Thus, 


let x=sinu, [u =arcsin x] 
dx = cosudu; 


then the integral becomes 
a) 2 
¥1l—sin° u cosudu= | cos-udu. 


The evaluation of this integral depends on the equation 


A 1 + cos 2u 
COS: t=: —— 


~ 


(see the discussion of trigonometric functions below) so that 


/ 2d / 1 + cos 2u ; u i sin 2u 
cos-udu =; | ——_——\_“ du = — ‘ 
2 2 4 


and 


2 4 
arcsin x l 


2 2 


arcsin x l 
= ——_ + —xry¥ 1 — x2. 


= arcsinx — sin(2 arcsin x) 
V1 —x*dx = ——_— + ——— 


sin(arcsin x) - cos(arcsin x) 
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Substitution and integration by parts are the only fundamental methods which 
you have to learn; with their aid primitives can be found for a large number of 
functions. Nevertheless, as some of our examples reveal, success often depends 
upon some additional tricks. ‘The most important are listed below. Using these 
you should be able to integrate all the functions in Problems | to LO (a few other 
interesting tricks are explained in some of the remaining problems). 


lL. TRIGONOMETRIC FUNCTIONS 
Since 
03 2 
sin” x + cos" x = 1 


and 


9 < 2 
COS:ZXx =:CcOS: X— sn 
we obtain 


2 2 2 
cos 2x = cos~ x — (1 — cos” x) = 2.cos* x —1, 


sD: ie) Cem J 
cos 2x = (1 —'sin* x) — sin* x = 1 — 2sin* x, 


or 
a4 1 — cos 2x 
sin” x = —————_— 
2 
5 1 + cos 2x 
cos? x = ———_ 
5 


These formulas may be used to integrate 


/ sin” x dx, 
i cos" x dx, 


(1 — cos2x) (1 + cos 2x) 
ESSE ET 9) ee et 


2 2 


if n is even. Substituting 


z ~ 2 4 ‘ ; ‘ ; x * 
for sin“ x or cos~x yields a sum of terms involving lower powers of cos. For 
example, 


| 1 —cos2x\? l | | 
[ sin’ xdx = i (— >=) dx = / qax - 5 | es 2x dx+ i [eo 2x dx 


and ; 4 
~ S /) Xx 
/ cos? 2x dx = i oe dx, 


If n is odd, n = 2k + 1, then 


‘ > ° , 
| sin” x dx = [sin x(1 —cos* x)‘ dx: 
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the latter expression, multiplied out, involves terms of the form sin x cos! xnallaet 
which can be integrated easily. The integral for cos” x is treated similarly. An 
integral 


/ sin” x cos” x dx 
is handled the same way if or m is odd. If n and m are both even, use the 


~ Ce y) 
formulas for sin“ x and cos? x. 
A final important trigonometric integral is 


COSRG 


| 
/ i [sc x dx = log(sec x + tan x). 


Although there are several ways of “deriving” this result, by means of the meth- 
ods already at our disposal (Problem 13), it is simplest to check this formula by 
differentiating the right side, and to memorize it. 


2. REDUCTION FORMULAS 


Integration by parts yields (Problem 21) 


a 1 A l= ] 9 
[ sv" x dx = —==— sin”! x cos + —— | sin” “xdx, 
n 


H 


1 n—| x 
[cos xdx = —cos"—!xsin x + —— | cos” 4xdx; 
n n 


/ ] tin 1 x a 2n — 3 il | , 
G21)" "7 2n=2 G4 Dy! > 2 G2 
and many stmilar formulas. The first two, used repeatedly, give a different method 


for evaluating primitives of sin” or cos”. The third is very important for integrating 
a large general class of functions, which will complete our discussion. 


3. RATIONAL FUNCTIONS 


Consider a rational function p/q where 


Pe) aye es eae, 
q(x) = [ ae De ee Si bo. 


We might as well assume that Qn = Dn = 1. Moreover, we can assume that n < m, 
for otherwise we may express p/g as a polynomial function plus a rational function 
which is of this form by dividing (the calculation 
9) 
ur ] 


= | + —— 
u— | a ey 


is a simple example). The integration of an arbitrary rational function depends 
on two facts; the first follows from the “Fundamental Theorem of Algebra” (see 
Chapter 26, Theorem 2 and Problem 26-3), but the second will not be proved in 
this book. 
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THEOREM 


THEOREM 


Every polynomial function 
7 6s) ee eS Ue A 8 
can be written as a product 
A(X) = 6 yy; ern) (ae Bix ty) -...- Geos Bix +)" 
(where ry +--+ + ry, + 2(5) +--+ + 5)) =m). 
(In this expression, identical factors have been collected together, so that all 


9 Cen 0 
x —a; and x + B)x + yj; may be assumed distinct. Moreover, we assume that each 
quadratic factor cannot be factored further. This means that 


5 
Pe Se (0) 
since otherwise we can factor 


2 Sh eee |e Cutan Ge a 
x° + Ppx+ty; = |x -— | —— Oe ae ea 


2D p 


into linear factors.) 


li « < mM and 


P(x) chs ale ae be pit ys 


q(x) Sy eet kee 4. 4. bb 


= (x — ay) (erg) (x? + Bix ty) +... + Bex ty)", 


then p(x)/q(x) can be written in the form 
p(x) ay Cie, 

=|) Sa te gp FS et 
q(x) (x — a) ce o1)"! 


i [et eee es | 
(o> ay) (Co Or) 
bax +ecy4 _ Dyers Cie, 
(CugiteieG + 1) (x? + Bix + 1)! 
by 1x +e Cy b1,5,X Ale (CH ay 


ee i gonier a or 
(x2 + Bix + 1) (x2 + Bx +)" 


This expression, known as the “partial fraction decomposition” of p(x)/q (x), Is 
so complicated that it is simpler to examine the following example, which illustrates 
such an expression and shows how to find it. According to the theorem, it is 
possible to write 

2x7 + 8x° + 13x? + 2004 + 15x73 + 16x7 + 7x + 10 

@o2 + ee 282 =F Oe + 2) —1)2 
CXL 
(x — 1)? x?4+2y +2 


exth 
(x2 +a + 1)? 


ex tf 
xe txt 


a b 
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To find the numbers a, b, c, d, e, f, g, and h, write the right side as a polynomial 
2 D 2 

over the common denominator (x2 +.x + 1)?(x? + 2x + 3)(x — 1)?; the numerator 

becomes 


a(x — 1)(x? +2x + Wie oe g a [\> -e b(x? +2x + 2)(x? +x+ he 
+ (ex +a — 1)? (x2 ch)? Hhex + AG — 1)°@? 2224 DES wD 
8 (ox 1H) (x Sal)? (eee 


Actually multiplying this out (!) we obtain a polynomial of degree 8, whose coef- 
ficients are combinations of a,...,4. Equating these coefficients with the coeffi- 
cients of 2x7 +8x5+ 13%°+20x7+ 15x34 16x2+7« + 10 (the coefficient of x° is 0) 
we obtain 8 equations in the eight unknowns a,...,/. After heroic calculations 
these can be solved to give 


a= |e 


b 
e=0 fF =0) g=]0. A=). 
‘Thus 


| Del SO 13> 0x =e 17x? FE lGa- 7 ae 7 
a SS at 


ce eee calla 


Hl : +/ a +/ 1 +f diay 
=) f —.— as —__ = (1x = dx —_§____———_ 14 
(x — ine (i — (ee eae Spee ae. : 


(In simpler cases the requisite calculations may actually be feasible. I obtained this 
particular example by starting with the partial fraction decomposition and convert- 
ing it into one fraction.) 

We are already in a position to find each of the integrals appearing in the above 
expression; the calculations will illustrate all the difficulties which arise in integrat- 
ing rational functions. 


The first two integrals are simple: 


! 
/ Te loz, — Ii, 


Nios 
2 —2 
—— dx = 
fe x— | 


The third integration depends on “completing the square” 


x+xtl=(xt+4)? 4+} 


(Uf we had obtained -} lustead of i we could not take the square root, but in this 


case our original quadratic factor could have been factored into lnear factors.) We 
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can now write 


| 1 re 16 ] Ps 
2-9 os 
a + 5 
=}+4+1 
3 
4 
The substitution 
ae t 
“= —_ 
ii 
] 
du = —dx, 
3 
a 
changes this integral to 
6p v2 
== Sana TEL du, 
9 (u+ + 1) 


which can be computed using the third reduction formula given above. 


x +3 
7) Sow dx 
(x* + 2x +2) 


/ xi 3 fas ati dx+ f —— 
a ems : 3 mE 2x (x peat 


The first integral on the right side has been oe constructed so that we can 


Finally, to evaluate 


we write 


evaluate it by using the substitution 


eg ee) 
du = Ox 2) dx 


The second integral on the right, which is just the difference of the other two, 1s 
simply 2 arctan(x + 1). If the original integral were 


x+3 2x +2 p 
==, dx f(s ax+ f as 
(x2 + 2x 4 2)" (x? + 2x +2) [x +1)? +17 


the first integral on the right would still be evaluated by the same substitution. 
The second integral would be evaluated by means of a reduction formula. 

This example has probably convinced you that mtegration of rational functions 
is a theoretical curiosity only, especially since it is necessary to find the factorization 
of q(x) before you can even begin. This is only partly true. We have already seen 
that simple rational functions sometimes arise, as in the integration 


1 my, 
i) ack - AN 
l-e 


another important example ts the mtegral 


| 7 4 | ee 
f=a dx = / = ae x Fi CO — 5 log(x — 1) — 5 log(x + 1). 
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Moreover, if a problem has been reduced to the integration of a rational function, 
it is then certain that an elementary primitive exists, even when the difficulty or 
impossibility of finding the factors of the denominator may preclude writing this 
primitive explicitly. 


PROBLEMS 


1. ‘This problem contains some integrals which require little more than alge- 
braic manipulation, and consequently test your ability to discover algebraic 
tricks, rather than your understanding of the integration processes. Never- 
theless, any one of these tricks might be an important preliminary step in 
an honest integration problem. Moreover, you want to have some feel for 
which integrals are easy, so that you can see when the end of an integration 
process is in sight. ‘The answer section, if you resort to it, will only reveal 
what algebra you should have used. 


; 2D) rn . . 

(v) i! tan* x dx. (Imigonometric integrals are always very touchy, because 
there are so many trigonometric identities that an easy 
problem can easily look hard.) 


faa) i: dx 
i a 
"i Juctasiin x 


oe — +4 
(ix) aan ae 
. x+]1 


l 
(x) Hl (ae ax. 


2. ‘The following integrations involve simple substitutions, most of which you 
should be able to do in your head. 


(1) / e* sine’ dx. 
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(1i) / xe dx. 


AS, log x ; 
(11) ‘ S dx. (In the text this was done by parts.) 


liv ip e* dx 
ex + 2er +10 


(v) / e° e dx. 


(vi) x dx é 
yf 
JX 

(vil) s dx 


(vill) [avi — x2 dx. 
(ix) ih log(cos x) tan x dx. 


i log (log x) 
——_—§_ dx. 


x log x 


3. Integration by parts. 


(1) pee dx. 
(11) bee dx. 


(ui) / e sin bx dx. 


° D 3 
(iv) i x~sin x dx. 


(v) [oe x) dx. 


log (log x 


x 
(vii) sec” x dx. (This is a tricky and important integral that often comes 
up. If you do not succeed in evaluating it, be sure to 


consult the answers.) 


(vill) [ costog.xyae. 
(ix) [ Ve t083 dx. 


(x) [ dog x)? dx. 


ay 
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The following integrations can all be done with substitutions of the form 
x = sinu, x = cosu, etc. ‘To do some of these you will need to remember 
that 


/ sec x dx = log(sec x + tan x) 


as well as the followimg formula, which can also be checked by differentiation: 
/ cse x dx = —log(csc x + cot x). 
In addition, at this point the derivatives of all the trigonometric functions 


should be kept handy. 


(i ax 


. (You already know this integral, but use the substitution 


/ vl ; . 
T—x* y = sinu anyway, just to see how it works out.) 


. ay. oO . 
(Since tan“ u + 1 = sec“ u, you want to use the substi- 


aD : 
VI+x*° tution x = tanu.) 


. (The answer will be a certain inverse function that was 


xv = (Sa 
x“ — 1! given short shrift in the text.) 
at dx 
ey at 


dx 


xV1l+x2 

; se Suey) ee F ; 

(vn) / xv 1 — x' dx. You will need to remember the methods for 
integrating powers of sin and cos. 

(viii) ik V1 —x2dx. 


(ix) [vi + x2dx. 
i: Vx2 — Idx. 


The following integrations involve substitutions of various types. There is 
no substitute for cleverness, but there is a general rule to follow: substitute 
for an expression which appears frequently or prominently; if two different 
troublesome expressions appear, try to express them both in terms of some 
new expression. And don’t forget that it usually helps to express x directly 
in terms of u, to find out the proper expression to substitute for dx. 


; aX 
0) w= 


Gi / dx 
IW lca 
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6. 


(ii) dx 
iil 
Vx t+ Vx 
: dx fee 2 
(iv) ————.. (The substitution u = e* leads to an integral requir- 
| +e* ing yet another substitution; this is all right, but both 


substitutions can be done at once.) 


(v) i af 
Vv ———.. 
2+tanx 


. (Another place where one substitution can be made to 


dx 
«) | 
J/x +1 do the work of two.) 
4*+ 1 
(vil) / a dx, 


2*+ 17° 
(viii) [ee 
: / vl - 
(1x) —— 


x : : ae 
dx. (In this case two successive substitutions work out best; 


[4s 


there are two obvious candidates for the first substitu- 
tion, and either will work.) 


The previous problem provided gratis a haphazard selection of rational func- 
tions to be integrated. Here is a more systematic selection. 


2x7 47x —1 
(i) Se 


xe 4+x2-x-] 
= 2x +1 
il ee ee 
My) / 3st 
Gi ‘i x3 47x? —5x4+5 
il a 
(x — 1)?(x + 1)3 
i / 2? xe 
iv —_—_—_—_—$— dx: 
(x + 3)(x — 1)? 
x+4 
(v) [3 is dx 
(vi / xg? , 
‘j —_—_—___— dx, 
wm x4 4 Dye I 
(vi / 3x743x41 ; 
Vii ——______——_ dx. 
| x3 4+2x24+2x+4+1 
(viii ih dx 
Vill ‘ 
A a ar, 
; 2x 
“1 or. 
3x 
(xX) i a dx 
Jo (x*¢+x4+ 1) 


Sie 


nO: 
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Find , which looks a little different from any of the previous 


dx 
problems. Hint: It helps to write (x” —x7)!/? = x(x""?—1)'/*. Extra Hint 1: 
Use a substitution of the form u? =... to obtain an answer involving arctan. 
Extra Hint 2: Use a substitution of the form y = x® to obtain an answer 
involving arcsin. 


Potpourri. (No holds barred.) The following integrations involve all the 
methods of the previous problems 


: arctan x 

1) ——— dx. 

" / 1+x? 

= xX arctan x 

(11) "i ———+— dx. 
(1+ x-)¢ 

(iii) [v8 V1+x?dx. 

(iv) [> log V¥1+4+x2dx. 


| l 
(v) if —— - ———— dx. 
; x24 ] V | 4+ x4 
(vi) i arcsin /x dx. 


x 
‘il ———- dx. 
(vu) / 1+ sinx oe 


3 . 
ee gay oe COS & Sse 
(vil) ews . —____ dx, 


5 
COS~ X 


(ix) [ Varrax, 


1x ae ; 
(x) i! =o (To factor x® + 1, first factor y* + 1, using Problem 1-1.) 
x 


The following two problems provide still more practice at integration, if you need 
it (and can bear it). Problem 9 involves algebraic and trigonometric manipulations 
and integration by parts, while Problem 10 involves substitutions. (Of course, in 
many cases the resulting integrals will require still further manipulations.) 


9. 


Find the following integrals. 
(1) [rose +x”) dx. 


“A 1+ cosx 
(11) SS dx. 
sn’ x 


br x+1 


(111) ————-. dx. 
V4—x? 


(iv) ‘i x arctan x dx. 
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10. 


1 Ws 


"2; 


ae 
(v) / sin” x dx. 
: sin” Xx 
(v1) —— dx. 
COS~ X 
oe 2) 
(vil) / x~ arctan x dx. 
ee x dx 
(vill) ih ————, 
vx2—2x+2 
(ix) i! sec? x tanx dx. 


(x) / x tan? x dx. 


Find the following integrals. 


‘i i dx 
i yas. 
| (a? + x*)? 


(i1) [ vir sinx ax. 

(iii) J erctan x dx. 

(iv) sin ne | dx. 

(v) [226 
Xx 


There are obvious substitutions to try, 
(vi) log(x + Vx? — 1) dx . 
\ S\- ; *- | but integration by parts is much easier. 
Comparing the answers obtained is, 
(vu) [roe + J/x)dx. perhaps, instructive. 


ams dx 
vill) ——_. 
wa) | x —x3/S 


(ix) [atesin x)? dx. 


re 5 2 

(x) x” arctan(x~) dx. 

If you have done Problem 18-10, the integrals (1i) and (ii) in Problem 4 will 
look very familiar. In general, the substitution x = coshu often works for 
integrals involving vy x? — 1, while x = sinhw is the thing to try for integrals 
. 8 my! r ee r . . . s . ) Re 
involving ¥x-+ 1. Try these substitutions on the other integrals in Prob- 
lem 4. (The method is not really recommended; it is easier to stick with 
trigonometric substitutions.) 


The world’s sneakiest substitution is undoubtedly 


9) 
x=2arctant, dx = ——~<d. 


Pitan 
1+72 


to| = 


*13. 
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As we found in Problem 15-17, this substitution leads to the expressions 
2r l= 
<= —— 
1+? 1+? 
This substitution thus transforms any integral which involves only sin and 


cos, combined by addition, multiplication, and division, into the integral of 
a rational function. Find 


Se — 


d. 
(1) i = (Compare your answer with Problem 1(vii).) 


ss dx a 
(11) il aaa 73 (In this case it is better to let t = tan x. Why?) 
ae Stele es 


ne dx ’ : 
(111) / a (There is also another way to do this, using 
OSI et eS roplem 15-8.) 


(iv) . sin- x dx. (An exercise to convince you that this substitution 
should be used only as a last resort.) 


dx 
(v) ‘ are (A last resort.) 


Derive the formula for f sec.x dx in the following two ways: 
(a) By writing 


] cos Xx 


cosx cos? x 
cos x 


a 
} — sin“ x 


[Pec Oces cos x 
= ee ee 
alee | 
an expression obviously inspired by partial fraction decompositions. Be 
sure to note that ip cos x/(1 —sinx) dx = —log(1 —sin x); the minus sign 
is very important. And remember that 5 log w = log Va. From there 
on, keep doing algebra, and trust to luck. 

(b) By using the substitution f = tanx/2. One again, quite a bit of manip- 
ulation is required to put the answer in the desired form; the expression 
tan.x/2 can be attacked by using Problem 15-9, or both answers can 
be expressed in terms of ¢. There is another expression for f sec x dx, 
which is less cumbersome than log(sec x + tan.x); using Problem 15-9, 
we obtain 


pee z 
tan — 
ye 


Nina Te 
[sco hs ele (tan (= a +), : 
Bs Zz 


] — tan 5 


This last expression was actually the one first discovered, and was due, 
not to any mathematician’s cleverness, but to a curious historical acci- 


388 Derwatives and Integrals 


14. 


[SE 


16. 


7. 


18. 


I 
20. 


Zl 


ata 


*23. 


dent: In 1599 Wright computed nautical tables that amounted to definite 
integrals of sec. When the first tables for the logarithms of tangents were 
produced, the correspondence between the two tables was immediately 
noticed (but remained unexplained until the invention of calculus). 


The derivation of { e* sinx dx given in the text scems to prove that the only 
priniiive softs f (x) =.e%sim x is F(x) = e* (gin x — cos)/2, whereas (4) — 
e*(sinx — cosx)/2+C is also a primitive for any number C. Where does C 
come from? (What is the meaning of the equation 


/ e*sinx dx = e* sinx — e* cosx — / e* sin x dx?) 
Suppose that f” is continuous and that 


ii GIFEy @)\smxvds— 2, 
0 


Given that f(z) = 1, compute f (0). 
(a) Find f aresin x dx, using the same trick that worked for log and arctan. 
*(b) Generalize this trick: Find f f~'(x) dx in terins of f f(x) dx. Compare 
with Problems 12-21 and 14-14. 
; ee 5 ; : ; 
(a) Find f sin” x dx in two different ways: first using the reduction formula, 
5 7 a 

and then using the formula for sin* x. 
(b) Combine your answers to obtain an impressive trigonometric identity. 
Express f log(log x) dx in terms of f{(logx)~! dx. (Neither is expressible in 
terms of elementary functions.) 


Ae 


4 mee eee 6 pees 
Expressifxte dx m terms of (fer *edx: 


Prove that the function f(x) = e*/(e* +e* +1) has an elementary primitive. 
(Do not try to find it!) 


Prove the reduction formulas in the text. For the third one write 


i dx a i dx / x* dx 
Cees 1)" = (x2 oh. ])r-! (x2 zs 1)" 


and work on the last integral. (Another possibility is to use the substitution 
X= tania) 


Find a reduction formula for 


(a) il x"e as 
(b) [coe x) tae 


Prove that 
cosh x 


cosh x sinh x 
Vt? —1dt = ———_——- -— 
I 


No] = 


9) 


— 


(See Problem 18-7 for the significance of this computation.) 
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24. Prove that 
b b 


aaa x — fla+b—x)dx. 


a a 
(A geometric interpretation makes this clear, but it is also a good exercise in 
the handling of limits of integration during a substitution.) 


25. Prove that the area ofa circle of radius r is zr. (Naturally you must remem- 
ber that z is defined as the area of the unit circle.) 


26. Let @ be a nonnegative integrable function such that @(x) = 0 for |x| > 1 


1 
and such that / =e tori = O let 
3] 


] 
bn(x) = mE 


h 


(a) Show that @;(x) = 0 for |x| > h and that Op =, 
—h 
(b) Let f be integrable on [—1, 1] and continuous at 0. Show that 


1 h 
lin f dat = lim | buf = £0. 
h->0* J_] h->0t J_p 
(c) Show that 


; h 
lim =a Uk = 
h->0* f_] he +x- 
The final part of this problem might appear, at first sight, to be an exact 
analogue of part (b), but it actually requires more careful argument. 


(d) Let f be integrable on [—1, 1] and continuous at 0. Show that 


1 
lim / bea yas = 2f(0). 


h—0+ 1 h2 x2 


Hint: If / is small, then h/(h? + x7) will be small on most of [—1, 1]. 
The next two problems use the formula 


ay 5 
5] fe) dé, 
2 a 

derived in Problem 13-24, for the area of a region bounded by the graph of f in 
polar coordinates. 


27. For each of the following functions, find the area bounded by the graphs in 
t=) PI 5 
polar coordinates. (Be careful about the proper range for 6, or you will get 
nonsensical results!) 


i) f(@) =asiné. 

(un) © f(@) =2 + cose. 
(iii) (0)? = 2a* cos 20. 
(iv) f(0) = a'cos 20. 
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FG LORE 


28. 


Figure | shows the graph of f in polar coordinates; the region OAB thus 
ie 2 : 
has area a f(@)° dé. Now suppose that this graph also happens to be 
9% 


the ordinary graph of some function g. ‘Then the region OAB also has area 


x0 
areca AOx,B +f g— area AOXoA. 


x] 


Prove analytically that these two numbers are mdeed the same. Hint: The 
function g is determined by the equations 


Me wae acos 6, ga) — (ee) sire. 


The next four problems use the formulas, derived in Problems 3 and 4 of the 
Appendix to Chapter 13, for the length of'a curve represented parametrically (and, 


in particular, as the graph ofa function in polar coordinates). 


Zo, 


30. 


ole 


Let c be a curve represented parametrically by u« and uv on [a,b], and let h 
be an increasing function with h(@) = a and h(b) = b. Then on [a,b] the 
functions 4 = woh, v = vol give a parametric representation of another 
curve C; mtuitively, ¢ is just the same curve c traversed at a different rate. 


(a) Show, directly from the definition of length, that the length of ¢ on la, b| 
equals the length of ¢ on [a,b]. 

(b) Assuming differentiability of any functions required, show that the 
lengths are equal by using the integral formula for length, and the ap- 
propriate substitution. 


Find the length of the following curves, all described as the graphs of func- 
tions, except for (ii), which is represented parametrically. 


Ge fix= 5 SD) >/2 Oe 


| 
oe 3 
x)=x? + —. |): a Pee 7 
ee eens = ee 
00 ee a> cos? ft, — a sin’ t, =f = 27. 
Ovie=((a) = los(eosx), =x = 706) 


( 
(v) f(a) = loo Ser. 
(vi 


vi) »"f (~)= aresmrc —log2 <x <0. 


For the following functions, find the length of the graph in polar coordinates. 


1) f(@) =a Gos?. 
ry) aco ). 


( 

( 

(i11)"_..7 (6) =a si”(0 /2). 
(iv) 

( 


iv Pe) =e 0.5 27, 


Vv" GO) = ssece OO 2/3. 


32. 


33. 


of. 


0. 
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In Problem 8 of the Appendix to Chapter 12 we described the cycloid, which 
has the parametric representation 


6 — 2, = eee y = u(t) S"a(l — cost): 


(a) Find the length of one arch of the cycloid. [Answer: 8a.] 


(b) Recall that the cycloid is the graph of vo u7!, 


Find the area under 
one arch of the cycloid by using the appropriate substitution in f f and 


. . 2) 
evaluating the resultant integral. [Answer: 3za°.| 


Use induction and integration by parts to generalize Problem 14-10: 


i ce w= f (| cath “fudr) dur) ..-) diy 
0 Te 0 0 0 


If f’ is continuous on [a,b], use mitegration by parts to prove the Riemann- 
Lebesgue Lemma for f: 


b 
lim f@®snQr da= 0. 
A> 0o 
a 
This result is just a special case of Problem 15-26, but it can be used to prove 
the general case (in much the same way that the Riemann-Lebesgue Lemma 
was derived in Problem 15-26 from the special case in which f is a step 
function). 


The Mean Value Theorem for Integrals was introduced in Problem 13-23. 
The “Second Mean Value ‘Theorem for Integrals” states the following, Sup- 
pose that f is integrable on [a,b] and that ¢ is either nondecreasing or 
nonincreasing on [a,b]. Then there is a number & in [a, b] such that 

b 


g 
Hii ae) dx = oa) | f(x)dx + ¢(b) f(x) ae. 
a 5 


b 


a 


In this problem, we will assume that f is continuous and that ¢ 1s differen- 
tiable, with a continuous derivative @’. 


(a) Prove that if the result is true for nonincreasing ¢, then it is also true for 
nondecreasing @. 

(b) Prove that if the result is true for nonincreasing @ satisfying @(b) = 0, 
then it is true for all nonincreasing ¢. 


Thus, we can assume that @ is nonincreasing and $(b) = 0. In this case, 
we have to prove that 
bh 


g 
FCO CST a ia) | FG Vine 


a 
(c) Prove this by using integration by parts 
12) Wg gre nye eb 
(d) Show that the hypothesis that @¢ is either nondecreasing or nonmereasing 
is needed. 
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From this special case of the Second Mean Value ‘Theorem for Integrals, the 


general case could be derived by some approximation arguments, just as in the case 
of the Riemann-Lebesgue Lemma. But there is a more instructive way, outlined 


in the next problem. 


36. 


(a) Given aj,...,@, and bj,..., by, let s, = ay +--+ + ay. Show that 


(EP opp + ieee en — 5] (Dy = 09) 459 (b7 03) 
ae eat Si (Or e1 aa by) al SiO hn 


This disarmimgly simple formula is sometimes called “Abel’s formula for summation 
by parts.” It may be regarded as an analogue for sums of the tegration by parts formula 


b b 
f' (x)g(x) dx = ftb)g(b) — fla)g(a) — f(x)g’(x) dx, 


a a 
especially if we use Riemann sums (Chapter 13, Appendix). In fact, for a partition 
P = {to,...,tn} of [a, b], the left side is approximately 


n 
(1) Yo Fath = tk-1), 
k=l 
while the right side is approximately 


at 


f(b) g(b) — fla)g(a) — Do f(t! (te) (tk — th—1) 


k=] 
which is approximately 
NH 
; g(%) — g(tp—1) 
f(b)g(b) — Flaygta) — J fi) SE BEY, — 1) 
| ik — tk-] 
iH 
= f(b)g(b) — fla)gla) + ‘3 f (ty) [9 (te) — 2(ty)] 
k=] 
n 
= f(b)g(b) — fla)g(a) + SY LF) — f(a)]- [g(t,_-1) — g(ty)] 
k= 


n 
+ fla) Y 7 gta) — Blt). 
k=1 


Since the right-most sum is just g(a) — g(b), this works out to be 


n 
(2) [ f(b) — flag) + DIL Fe) — fa) [¢u—1) - 84), 
=) 
If we choose 
a. = f (ty) (ty = (eid br = B(tk—-}) 


then 
n 


(1) is arbe, 


=| 
which is the left side of (*), while 
k k 
= Sy f p(y —tj-)) iS approximately ys f(t) — fj—-1) = Fy) — f(a), 
7 al 


SO 
Hn 
(2) is approximately Snbn + » sp (bp — bg—), 
a 


which is the right side of (*). 
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This discussion is not meant to suggest that Abel’s formula can actually be derived trom 
the formula for integration by parts, or vie versa. But, as we shall see, Abel’s formula can 
often be used as a substitute for integration by parts in situations where the functions in 
question aren't differentiable. 


(b) Suppose that {b,} is nonincreasing, with b, > 0 for each n, and that 
m<a;+---+a, <M 
for all n. Prove Abel’s Lemma: 
bim < ayby +--+ + aybn < bi) M. 


(And, moreover, 
bem < apby + +--+ a,b, < byM, 


a formula which only looks more general, but really isn’t.) 

(c) Let f be integrable on [a,b] and let ¢ be nonincreasing on [a, b] with 
p(b) = 0. Let P = {to,...,t} be a partition of [a,b]. Show that the 
sum 


So fb (1s = 4-1) 


i=] 


lies between the smallest and the largest of the sums 


k 
pla) x PF G-G = hy): 


i=l 


Conclude that 
) 


f(x)o(x) dx 


a 


lies between the minimum and the maximum of 


oa) | f(t) dt, 


a 
E 
and that it therefore equals ota) | f(t) dt for some & in [a, bd]. 
a 


37. (a) Show that the following improper integrals both converge. 
(i) i sin (s + | dx. 
0 x 
Pan (ee 
(11) sin” | x + —:} dx. 
0 x 


(b) Decide which of the following improper integrals converge. 


eS | 
(1) / sin (-) dx. 
x 
Sc eee | 
(11) / sin (-) dx. 
JI x, 
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38. 


39) 


*40. 


*41, 


Il 
(a) Compute the (improper) integral [ log x dx. 
0 


1 
(b) Show that the improper integral / log(sin.x) dx converges. 
0 


(c) Use the substitution x = 2u to show that 


4 m/2 n/2 
/ log(sinx)dx = 2 | log(sin.x) dx + om log(cos.x)dx + log 2. 
0 0 0 


n/2 
(d) Compute / log(cos.x) dx. 
0 


us 
(ec) Using the relation cos x = sin(a/2 — x), compute / log(sin.x) dx. 
0 


Prove the following version of integration by parts for improper integrals: 
CO 


i u'(x)v(x)dx = u(x)v(x) 


a 


— i ree xs 


a 


The first symbol on the right side means, of course, 


hm u(x)v(x) — u(a)v(a). 


One of the most important functions in analysis is the gamma function, 


re 
ro} gull 
0 


(a) Prove that the improper integral (x) is defined if x > 0. 
(b) Use integration by parts (more precisely, the improper integral version 
in the previous problem) to prove that 


ital) =r ee 


(c) Show that [(1) = 1, and conclude that ['(#) = (a — 1)! for all natural 
numbers 7. 


The gamma function thus provides a simple example ofa continuous function 
which “interpolates” the values of n! for natural numbers #. Of course there 
are infinitely many continuous functions f with f(7) = (a — 1)!; there are 
even infinitely many continuous functions f with f(x + 1) = xf (x) for all 
x > 0. However, the gamma function has the important additional property 
that log © P is convex, a condition which expresses the extreme smoothness 
of this function. A beautiful theorem due to Harold Bohr and Johannes 
Mollerup states that T° is the only function f with logo f convex, f(1) = | 
and f(« + 1) =xf(x). See reference [43] of the Suggested Reading. 


(a) Use the reduction formula for f sin” x dx to show that 


4 


5) 
ae ne ON eae ee 
hy 7 Sa san Xx eax. 
J0) H 0 
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(b) Now show that 


ye Be 2 4 6 2n 
if sin x dx sige ge et Pana 
[ sntsar =F 338 sp ciate 2n — | 
0 ees Ie ee. 6G an 
and conclude that 
Be 2n 
7 OD Ae 6 On an if sin~ x dx 
2°43 535 5 al Bel pF 
i sin xdx 


(c) Show that the quotient of the two integrals in this expression is between 
1 and | + 1/2n, starting with the inequalities 


2n+1 | 


= LD, ee pe 
0) < sin x<sm~ x ssn x forlOu=<x < 2/2: 


This result, which shows that the products 


2 2 & 4 i& © 2n 2n 


ress 5S 7 Qa) Qn 
can be made as close to 2/2 as desired, is usually written as an infinite 
product, known as Wallis’ product: 


ee ee ee ee 


(d) Show also that the products 


1 2+4-6-...+ 2n 
J/n 1-3-5-...-(Q2n —1) 
can be made as close to ./7 as desired. (This fact is used in the next 
problem and in Problem 27-19.) 
Wallis’ procedure was quite different! He worked with the integral fia — x7)" dx 


(which appears in Problem 42), hoping to recover, from the values obtained for natural 
numbers n, a formula for 


A complete account can be found in reference [49] of the Suggested Reading, but the 
following summary gives the basic idea. Wallis first obtained the formula 


u 5 2 4 2n 
(1 —x-Y ar = : 
I 5 


‘S  Wat 
(24 -+.2n)? 2” (n!)? 
~ 2.3.4--.2nQn+1) ~~ 2n+1 Qa) 


He then reasoned that 2/4 should be 


! 2! (4) 
Dy NE WY 
ih 3") F a= a ai = (5!) 
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If we interpret 5! to mean F(1 + 5), this agrees with Problem 45, but Wallis did not 
know of the gamma function (which was invented by Euler, guided principally by Wallis’ 


a; 
work). Since (2n)!/(n!)? is the binomial coefficient (, ‘h Wallis hoped to find 5! by 
n 
finding Ci a ‘) for p =g = |/2.-Now 
P 


pt+@q oe a) ahd = (ae WD) 
P q! 


and this makes sense even if p is not a natural number. Wallis therefore decided that 


Z+q\ 5 or )-:: (3) 
7 q! 


= nok : Mate ae 
With this interpretation of i. ') for p = 1/2, it is still true that 
» gf 


oe pages) 
p gti pos 


~ NI— 


] 
: 5 +4 : : : 
Denoting (? \ ) by W(q) this equation can be written 
3) 
| 
x7 +qt+1 2q+3 
W(g + 1) = ———— W(g) = ——~ W(q), 
Wee +1 a) 2g +2 (9) 
which leads to the table 
q ! 2 3 


"ey, Se ree 


2 
tol— 
bolw 
pin 


Next Wallis notes that if a,, a2, a3, ag are 4 successive values W(q), W(q + 1), W(q +2), 
W(q + 3), appearing in either of these tables, then 

a Q a4 oe 2q+3 2qg+5 2q¢+7 

JS SS fice == > +S, Sa 

aq a @ 2q+2 2q¢+4 2q¢+6 


(this says that logo(1/W) is convex, compare the remarks before Problem 41), which 


unphes that 
a3 a3 a4 
—>—> /—. 
ay a ay 


Wallis then argues that this should still be true when a1, a2, a3, aq are four successive 

values in a combined table where q is given both mteger and half-integer values! ‘Thus, 
: : 3 : 

taking as the four successive values W (na + I Win), Win + 5), Win + 1), he obtains 


4 4 6 
7 Me 
2PES ae 
spa 


which yields simply 


2n+4 4 DP ahettha (§ 0 (ono (QM Qn ss 2 Qn a3 
Se fe | ss | 
V2n+3 2 Tn Dy se D 


from which Wallis’ product follows mmmediately. 


ie Oa 


LA IE 
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oO 
It is an astonishing fact that improper integrals / f(x) dx can often be 
0 


b 
computed in cases where ordinary integrals f(x) dx cannot. There is no 


a 
CO 


b 
22. ee 
elementary formula for , e * dx, but we can find the value of i ey dx 
0 


a 
precisely! There are many ways of evaluating this integral, but most require 
some advanced techniques; the following method involves a fair amount of 
work, but no facts that you do not already know. 


(a) Show that 


[iu Dine ok 2n 

3 La bo 5 ans 

if ] XN 3 2 
—— iii i 

9 U+x2)" pmo A Dy = 9 


(This can be done using reduction formulas, or by appropriate substitu- 
tions, combined with the previous problem.) 


(b) Prove, using the derivative, that 


be eee for 0 == I 
5 l 
—— Or 
l+x- 


(c) Integrate the nth powers of these inequalities from 0 to | and from 0 to 
oo, respectively. Then use the substitution y = /n x to show that 


me 4 2n 
Noe se... 
I 2n+ 1 
vn, OS 
</ ap dy = | @- dy 
0 0 
A 13 2n—3 
<< = W— + meee 
oe eee 7) ay, 


(d) Now use Problem 41(d) to show that 


(a) Use mtegration by parts to show that 


> sin x cosa cosh > cosx 
Xa — — — be 
a a b tee 5 


OG C = ead . i 
and conclude that fy (sin.x)/x dx exists. (Use the left side to investigate 
the limit as @ > OF and the right side for the limit as b > oo.) 
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(b) Use Problem 15-33 to show that 


[ sin( + ly 
———<——dt=n 
0 ! 


sin = 
ao 
for any natural number n. 
(c) Prove that 
; I 2 l 
lim sin(A + 5)¢ | — — —— | dr =0. 

A> J) af t ~ of 
sin = 
ais 


Hint: The term in brackets is bounded by Problem 15-2(vi):; the 
Riemann-Lebesgue Lemma then applies. 
(d) Use the substitution u = (A + ly, and part (b) to show that 


eo Sinx: 
dx= 
0 vo 


22) 
44. Given the value of i) (sin x)/x dx from Problem 43, compute 
0 


| (=) m 
0 x 


by using integration by parts. (As in Problem 38, the formula for sin 2x will 


m| a 


play an important role.) 


*45. (a) Use the substitution u = 1f* to show that 


: Lf cea 
C(x) = - e du. 
DAN H's) 


(b) Find P(5). 


Grae : : 
is integrable on every interval [a,b] for 0 < a < b, 


*46. (a) Suppose that 
and that lim F(x) = A and tim f(x) = B. Prove that for all a, B > 0 
we have 

fF flax) — f (Bx) eT ee p 
JO x a 
"Nf (ax) — f (Bx) 


Xx 


dx use two different substitutions. 


Hint: ‘To estimate | 


f(x) 


x 


dx converges for all a > 0 and that 


x 
(b) Now suppose instead that / : 
/a 


lim f(x) = A. Prove that 
r—0 


SD 


lis flax) — f(Bx) 
. dx 
( 


x = Alog —. 
J0 A 


2 
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(c) Compute the following integrals: 


00 pax __ pe px 
(1) | aa nn 
0 x 


Gi) [ —cos(ax) — cos(Bx) Pe 
0 


X 


In Chapter 13 we said, rather blithely, that integrals may be computed to any 
degree of accuracy desired by calculating lower and upper sums. But an applied 
mathematician, who really has to do the calculation, rather than just talking about 
doing it, may not be overjoyed at the prospect of computing lower sums to evaluate 
an integral to three decimal places, say (a degree of accuracy that might easily be 
needed in certain circumstances). The next three problems show how more refined 
methods can make the calculations much more efficient. 


We ought to mention at the outset that computing upper and lower sums might 
not even be practical, since it might not be possible to compute the quantities mm; 


and M; for each interval [4)-1, 4]. It is far more reasonable simply to pick points .; 
Hn 


in [/;-1, 4] and consider f Qi) + G — 1). This represents the sum of the areas 


rl 
of certain rectangles which partially overlap the graph of f—see Figure | in the 


Appendix to Chapter 13. But we will get a much better result if we instead choose 
the trapezoids shown in Figure 2. 


ET Giese 


Suppose, in particular, that we divide [a,b] into n equal intervals, by means of 


the points 
_[b-a : 
f=a +! =artih. 


n 
Then the trapezoid with base [4,-1.4] has area 


ee 


= i — hea) 
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and the sum of all these areas is simply 


By =| OE LO, ALO yg OEM) 


2 2 2 
: fay 425 f +ih) + f(b) = —! 

=z a LZ a Til 2 . i= c 
2 = : H 


This method of approximating an integral is called the ¢rapezoid rule. Notice that to 
obtain X, from &,, it isn’t necessary to recompute the old f(¢;); their contribution 
to ZX, 1s just 5 Zn. NO an practice it is best to compute U3, H4, Ug, ... to get 
b b 
approximations to ff. In the next problem we will esumate { f — Zp. 
a a 
47. (a) Suppose that f” is continuous. Let P; be the linear function which agrees 
with f at 4-1 and ¢;. Using Problem 11-46, show that if 2; and N; are 
the minimum and maximum of f” on [f_1, f;] and 


1, 
p= fener =nyas 
t_] 


t 


then 


1,1 = f pate N,l 
7 = Cl ee) Pa a | 
t,-] 


t 


(b) Evaluate / to get 


n,h> ft N;h? 
= > Cf = Ep) 24 
t-] 


12 Ilys 


(c) Conclude that there is some c m (a,b) with 


b 3 
: (a 
= an a. SpA yaa 

f 12n2 


i} 1 
Le: 
a 
: . 26 >) ‘ B) c 
Notice that the “error term” (b — a)? f’(c)/12n? varies as 1/1? (while 
the error obtained using ordinary sums varies as 1/1). 


We can obtain still more accurate results if we approximate f by quadratic 
functions rather than by linear functions. We first consider what happens when 


FIGURE 3 the mterval [a, b] is divided into two equal intervals (Figure 3). 


48. (a) Suppose first that a = 0 and b = 2. Let P be the polynomial function 
of degree < 2 which agrees with f at 0, 1, and 2 (Problem 3-6). Show 
that 


9) 


. ] 
ih P= 31/0) +4f() + FO). 
QO ~ 


(b) Conclude that m the general case 


b p= ] 
i P= A flay+4f Bot -) + f(b)}. 
a 6 2 
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b b 
(c) Naturally a f when f is a quadratic polynomial. But, re- 


a a 
markably enough, this same relation holds when / is a cubic polynomial! 
Prove this, using Problem 11-46; note that f 


wt 


is a constant. 


The previous problem shows that we do not have to do any new calculations 


b 

to compute i! Q when Q 1s a cubic polynomial which agrees with f at a, b, and 
a 

at+b 

: we still have 


p b-al. a+b 
9 ==" | ray +4r( 5 ) +o]. 


But there is much more lee-way in choosing Q, which we can use to our advantage: 


49. (a) Show that there is a cubic polynomial function Q satisfying 


Fe 8 ; 
O(a) = f(a), O(b) = f(b). o(% i (“4 ) 


2 


- 


o(S j=r(S). 


Hint: Clearly OG) = PO) Al — 9) Ga) (x = 


a+b 


) for some A. 
(b) Prove that if f is defined on [a, b], then for every x in [a,b] we have 
or al) 
x — b)——— 

2 4! 


for some € in (a,b). Hint: Imitate the proof of Problem 11-46. 
(c) Conclude that if f is continuous, then 


b b—a a+b (b-—a) 44) 
; f= 72" 1 ra+as ( 5 ) +10] ses) 


for some c in (a, D). 


F (iw Opts ex —m) (: = 


(d) Now divide [a, b}] into 27 intervals by means of the points 


p b—a 
tt =actih, = : 
2n 


- 


Prove Sempson’s rule: 


n—| 


eee Only a . - 
i r= PE ( re e4y sno 42E sea) + 70) 


‘il 
(b — a) 
2880n4 


for some ¢ in (a, bd). 
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FIGURE 3 


APPENDIX. THE COSMOPOLITAN INTEGRAL 


We originally introduced integrals in order to find the area under the graph of 
a function, but the integral is considerably more versatile than that. For example, 
Problem 13-24 used the integral to express the area of a region of quite another 
sort. Moreover, Problem 13-25 showed that the integral can also be used to ex- 
press the lengths of curves—though, as we've seen in Appendix to Chapter 13, a 
lot of work may be necessary to consider the general case! This result was prob- 
ably a little more surprising, since the integral seems, at first blush, to be a very 
two-dimensional creature. Actually, the integral makes its appearance in quite a 
few geometric formulas, which we will present in this Appendix. ‘To derive these 
formulas we will assume some results from elementary geometry (and allow a little 
fudging). 

Instead of going down to one-dimensional objects, we'll begin by tackling some 
three-dimensional ones. ‘Vhere are some very special solids whose volumes can 
be expressed by integrals. ‘he simplest such solid V is a “solid of revolution,” 
obtained by revolving the region under the graph of f > 0 on [a,b] around 
the horizontal axis, when we regard the plane as situated in space (Figure 1). 
If P = {t,..-,¢,} is any partition of [a,b], and m; and M; have their usual 
meanings, then 


, 
em; (ti) — t;—)) 


is the volume of a dise that lies inside the solid V (Figure 2). Similarly, 
aM UG) tj) is the volume of a disc that contains the part of V between f-| 
and ¢;. Consequently, 


i it 
9) 9) 
Tt Yo mir (t —t;-1) < volume V <z So MiP (ti —f;-1). 


il f—Il 


But the sums on the ends of this inequality are just the lower and upper sums for 


f? on [a,b]: 
1: mr Pi) = volume VY = 7 - Ge P). 


Consequently, the volume of V must be given by 


b 
volune V =z fxyoan. 


a 
This method of finding volunies is affectionately referred to as the “disc method.” 
Figure 3 shows a more complicated solid V obtained by revolving the region 
under the graph of f around the vertical axis (V is the solid left over when we start 
with the big cylinder of radius 6 and take away both the small cylinder of radius a 
and the solid V; sitting right on top of it). In this case we assume a > 0 as well 


FIGURE 6 


FIGURE 7 


19, Appendix. The Cosmopolitan Integral 403 


as f > 0. Figures 4 and 5 indicate some other possible shapes for V. 
f= g | | 


FIGURE 4 


JEG WORT, 3) 


bor a partition P = {t9, ..., tn} we consider the “shells” obtained by rotating the 
rectangle with base [f;-1,t;] and height m, or M; (Figure 6). Adding the volumes 
of these shells we obtain 
u n 
ris dit a Pai) < volume V <z Ne Mie eae = poe 
il tll 
which we can write as 
n 


a So milti T 4-4) — Bes volume V coe yo M (iF 1), ae). 


i=] i=] 


— 


Now these sums are not lower or upper sums of anything. But Problem | of the 
Appendix to Chapter 13 shows that each sum 


HW il 


So miti(t; teal) and So mitt — ti-1) 


b 

can be made as close as desired to / xf(x)dx by choosing the lengths ¢; — 4-1 
a 

small enough. ‘The same is true of the sums on the right, so we find that 


b 
volume V = an f RCO 
a 
this is the so-called “shell method” of finding volumes. 

The surface area of certam curved regions can also be expressed in terms of inte- 
erals. Before we tackle complicated regions, a httle review of elementary geometric 
formulas may be appreciated here. 

Figure 7 shows a right pyramid made up of triangles with bases of length / and 
altitude s. ‘Vhe total surface area of the sides of the pyramid is thus 


, 
as : 


where p is the perimeter of the base. By choosing the base to be a regular polygon 
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FIGURE 8 


(b) 


PAG CURSES 9) 


FIGURE 10 


with a large number of sides we sec that the area ofa right circular cone (Figure 8) 
must be 


—(2nr)s =a7rs, 
a )s 


where s is the “slant height.” Finally, consider the frustum of a cone with slant 
height s and radu ry and rz shown in Figure 9(a). Completing this to a cone, as 
in Figure 9(b), we have 


Sy} fae os 
ry ry : 
sO 
rys ros 
s) = ——., sp+s = 
n-r ee | 
Consequently, the surface area is 
2 
(oe SAE) 
wro(s| +s) —arys; = As——— = As(r, +1). 
mr 


Now consider the surface formed by revolving the graph of f around the hori- 
zontal axis. For a partition P = {fo,....t,} we can inscribe a series of frustums of 
cones, as in Figure 10. ‘The total surface area of these frustums 1s 


ry [fH + fly Gi — 4-v? + [fGi) — fev)? 


Teall 


=") [fi + fi] ae ) (t; — ti-1). 


ill 


By the Mean Value Theorem, this is 


nS [FG + fi) ]V1 + f' Qi)? i = G1) 
i=l 
for some x; in (f;-1,4;). Appealing to Problem 1 of the Appendix to Chapter 13, 


we conclude that the surface area 1s 


b 
an f eo lean ee a. 


PROBLEMS 


1. (a) Find the volume of the solid obtained by revolving the region bounded 
9) . . 
by the graphs of f(x) =x and f(x) = x* around the horizontal axis. 
i g 
(b) Find the volume of the solid obtained by revolving this same region 


around the vertical axis. 


2. Find the volunie of a sphere of radius r. 
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‘ x on E é 3 ‘ > > 9 > é 

3. When the ellipse consisting of all points (x, y) with x-/a> + y~/b- = | is 
rotated around the horizontal axis we obtain an “ellipsoid of revolution” 
(Figure 11). Find the volume of the enclosed solid. 


FIGURE 11 


4. Find the volume of the “torus” (Figure 12), obtained by rotating the circle 
>) a) oy, : : ; 
(x — a)- + y- = b- (a > b) around the vertical axis. 


length b 


eS 


FIGURE 12 


5. A cylindrical hole of radius a is bored through the center of a sphere of 
radius 2a (Figure 13). Find the volume of the remaining solid. 


FIGURE 13 
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6. (a) For the solid shown in Figure 14, find the volume by the shell method. 


FIGURE 14 


(b) ‘This volume can also be evaluated by the disc method. Write down 
the integral which must be evaluated in this case; notice that it is more 
complicated. ‘The next problem takes up a question which this might 


suggest. 


7. Figure 15 shows a cylinder of height b and radius f(b), divided into three 
solids, one of which, V,, is a cylinder of height a and radius f(a). If f 
is one-one, then a comparison of the disk method and the shell method of 
computing volumes leads us to believe that 


b 
abf(b) = maf (a) —7 f(x)? dx = volume V> 


‘ f(b) 
an f yf '(y)dy. 


fla) 


Prove this analytically, using the formula for / f~! from Problem 19-16, or 


more simply by going through the steps by which this formula was derived. 


f(b) 


FIGURE 15 


FIGURE 16 


10. 


11. 
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(a) Figure 16 shows a solid with a circular base of radius a. Each plane 
perpendicular to the diameter AB intersects the sold in a square. Using 
arguments similar to those already used in this Appendix, express the 
volume of the solid as an integral, and evaluate it. 

(b) Same problem if each plane intersects the solid in an equilateral triangle. 


Find the volume of a pyramid (Figure 17) in terms of its height / and the 
area A of its base. 


mY, 


GRO ig, 


Find the volume of the solid which is the intersection of the two cylinders 
in Figure 18. Hint: Find the intersection of this solid with each horizontal 


Ly 


plane. 


RGU ES 


. . ~ . . oy) 

(a) Prove that the surface area of a sphere of radius r is 4zrr-. 
(b) Prove, more generally, that the area of the portion of the sphere shown 
in Figure 19 is 2arh. (Notice that this depends only on h, not on the 


position of the planes.) 


WUTOA ETS IE, TS) 
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12. 


13. 


(c) A circular mud puddle can just be covered by a parallel collection of 
boards of length at least the radius of the circle, as in Figure 20 (a). Prove 
that it cannot be covered by the same boards if they are arranged in any 
non-parallel configuration, as m (b). 


— 


(a) (b) 


PLG URE 20 


(a) Find the surface area of the ellipsoid of revolution in Problem 19-3. 
(b) Find the surface area of the torus in Problem 19-4. 


The graph of f(x) = 1/x, x = 1 1s revolved around the horizontal axis 
Figure 21). 


( 

(a) Find the volume of the enclosed “infinite trumpet.” 

(b) Show that the surface area 1s mfinite. 

(c) Suppose that we fill up the tumpet with the finite amount of pamt found 
in part (a). It would seem that we have thereby coated the infinite inside 
surface area with only a finite amount of paint. How is this possible? 


FIGURE 21 


PART fi 


INFINITE 
SEQUENCES 
AND 
INFINITE 
SERIES 


One of the most remarkable series of 
algebraic analysis is the following: 


pit gee eee 


1 P-2 
m(m — 1)(m — 2) 
hes 
m(m — 1)--- [m—(n— ])] 


A i eke G 


+ x" 


When m ts a positive whole number 

the sum of the series, 

which ts then finite, can be expressed, 

as is known, by (1 + x)". 

When m is not an integer, 

the series goes on to infinity, and it will 
converge or diverge according 

as the quantities 

mand x have this or that value. 

In this case, one writes the same equality 


(Itxn=1+—x 


m(m — 1) 
1-2 
.. . It ts assumed that 
the numerical equality will always occur 
whenever the sertes 1s convergent, but 
this has never yet been proved. 


x2+--- ele. 


NIELS HENRIK ABEL 


CHAPTER 


APPROXIMATION BY 
POLYNOMIAL FUNCTIONS 


There is one sense in which the “elementary functions” are not elementary at all. 
If p is a polynomial function, 


P(X) = a9 + ayx +--+ + ayx", 


then p(x) can be computed easily for any number x. This is not at all true for 
functions like sin, log, or exp. At present, to find logx = fy l/tat approximately, 
we must compute some upper or lower sums, and make certain that the error 
involved in accepting such a sum for logx is not too great. Computing e* = 
log~'(x) would be even more difficult: we would have to compute log a for many 
values of a until we found a number a such that log a is approximately x—then a 
would be approximately e’. 

In this chapter we will obtain important theoretical results which reduce the 
computation of f(x), for many functions f, to the evaluation of polynomial func- 
tions. ‘The method depends on finding polynomial functions which are close ap- 
proximations to f. In order to guess a polynomial which is appropriate, it is useful 
to first examine polynomial functions themselves more thoroughly. 

Suppose that 

P(X) = ag + ayxX +++ + ay x", 


It is interesting, and for our purposes very important, to note that the coefficients a; 
can be expressed in terms of the value of p and its various derivatives at 0. To 
begin with, note that 


p(O) = ag. 
Differentiating the original expression for p(x) yields 
Piya 4+200x > nan”. 


Therefore, 
pa Oe play = ay. 


Differentiating again we obtain 
p’ (x) = 2a2 +3 -2-a3x +---+n(n—1)-a,x"~?2. 
Therefore, 
” (2) 
p (0) = p“’(0) = 2a. 

In general, we will have 
p® (0) 

kl 
If we agree to define 0! = 1, and recall the notation p = p, then this formula 
holds for k = O also. 


p (0) =kla, or a= 


41] 
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Ifwe had begun with a function p that was written as a “polynomial in (x —a),” 
P(x) =ag tax —a) +--+ +a,(x —a)", 
then a smular argument would show that 


p(a) 
a. = 
k! 


Suppose now that f is a function (not necessarily a polynomial) such that 


F2G).. 7 @ 


all exist. Let 


and define 
Praia = dag + a(x Hil ee tect CEN Oe, 


The polynomial P,.¢ is called the Taylor polynomial of degree n for f at a. 
(Strictly speaking, we should use an even more complicated expression, like Pya, f, 
to indicate the dependence on f; at times this more precise notation will be useful.) 
The ‘Taylor polynonual has been defined so that 


Ban (a) aT (a) tor 0 = kh 


in fact, it is clearly the only polynomial of degree <n with this property. 

Although the coefficients of Py.a,¢ seem to depend upon f in a fairly comph- 
cated way, the most important elementary functions have extremely simple ‘Taylor 
polynomials. Consider first the function sin. We have 


sin(O) = 0, 
sin’(O) = cosO = 1, 
sin’(O) = —sin0 = 0, 
sin” (0) = —cos0 = —1, 
sin (0) = sin0 = 0. 


From this point on, the derivatives repeat in a cycle of 4. ‘Vhe numbers 


sin” (QO) 
i 
Are 
| | | | 
00 12. 0; — 3 0, ar" 0, 3 or 
Therefore the ‘Faylor polynomial P2,,,1,9 of degree 21 + 1 for sin at 0 ts 
P. =e, an 4 al 
eS BT a Qn+ DV 


(Of courst, Wo, = Font? 0): 
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The ‘Taylor polynomial P2,9 of degree 2n for cos at 0 is (the computations are 
left to you) 


Bi 4 6 2n 
: ee het Pee = 
Pan Q(X) = By 7 4! 6! * + ( (2n)! ; 


The ‘Taylor polynomial for exp is especially easy to compute. Since exp“ (0) = 
exp(O) = 1 for all k, the Taylor polynomial of degree n at 0 is 


A 
y x4 x? 


baum - 
20a ae 

The ‘Taylor polynomial for log must be computed at some point a ¥ 0, since 
log 1s not even defined at 0. The standard choice is a = 1. Then 


Prot) => 


log’ (x) = -, loc (Dyer 

log’(x) = — = log’(1) = -1; 
ss 

log” (x) = - log (ly — 2. 


m general 


SN ed: lbh 
iovaeeg le 41) =) ee = 


log (x) = : 
X 


Therefore the ‘Taylor polynomial of degree n for log at | 1s 


we he x-13 Sy ee aye 
fe nn, oe een 
2 3 n 


It 1s often more convenient to consider the function f(x) = log(| +x). In this 
case we can choose a = 0. We have 
f@) slog (1 + x), 
SO 


fO) Sloe) =)! ape 


Therefore the Taylor polynomial of degree n for f at 0 is 


a x2 n x? xt b 2 (—1)"7!x" 
f x) S37 =] ee bactr es 
uae tom A n 


There is one other elementary function whose ‘Taylor polynomial ts mportant 
arctan. ‘The computations of the derivatives begin 


arctan (x) = arctan’(0) = |; 


k m2 
ye 
(1 + x2)?’ 


tan’”(x) (1 +.x7)? - (—2) + 2x - 2(1 +17) - 2x 
71 OO RR 6 
=" CF at2 ys 


oa 


arctan’ (x) = arctan” (0) = 0: 


i) 


arctan” (0) = —2. 
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It is clear that this brute force computation will never do. However, the Taylor 
polynomials of arctan will be easy to find after we have exanined the properties 
of ‘Taylor polynomials more closely——although the Taylor polynomial Py.¢,¢ was 
simply defined so as to have the same first 7 derivatives at a as f, the connection 
between f and Py.a,¢ will actually turn out to be much deeper. 

One line of evidence for a closer connection between f and the ‘Taylor polyno- 
mials for f may be uncovered by examining the ‘Taylor polynomial of degree 1, 
which is 


Py g(x) = fla) + flax — a). 


Notice that 
of hex) = Py a(x) = (aCe) =a fa) 


PSs &! 


— f(a). 


~— O 
Now, by the definition of f’(a) we have 


(x) =a Vial O38) a 


XxX =a 


0. 


lim 


xa 


2! 


j Ne 
ez) ate 


Eep(x)e= 1 +x 


PUG URES 


In other words, as x approaches a the difference f(x) — Pj,q(x) not only becomes 
small, but actually becomes small even compared to x — a. Figure 1 illustrates the 
graph of f(x) =e* and of 


Pj oy = f O).+ f Ox =1+4 x, 


which is the Taylor polynomial of degree | for f at 0. ‘The diagram also shows 
the graph of 


nig (O) ) ; x 


Po.(x)'= f(0)+ f'O) + ——x 


which is the ‘Taylor polynomial of degree 2 for f at 0. As x approaches 0, the 
difference f(x) — P2.9(x) seems to be getting small even faster than the difference 
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f(x) — Pio(x). As it stands, this assertion is not very precise, but we are now 
prepared to give it a definite meaning. We have just noted that in general 


li LX) = Pia) =) 
xX—a x—a 


For f(x) = e* and a = 0 this means that 
x) — Pi o(x . e—l—-x 
A NSE = lim ————— = 0. 


lim 
x x0 x 


10 


On the other hand, an easy double application of ’H6pital’s Rule shows that 


Thus, although f (+) — P},9(«) becomes small compared to x, as x approaches 0, it 
9 ~ . . . . - 
does not become small compared to x. For P2.9(x) the situation is quite different; 


? . . . . 
the extra term x*/2 provides just the right compensation: 


: 
x ] ae 
(5 —, > ex — ] Hy 
lim = = lim 
x0) r>0 aX 
x 
. et) 
= lim = () 
x0 Z 


12 
Xa (x —a)- 


in fact, the analogous assertion for P, 4 1s also true. 


THEOREM 1 Suppose that f is a function for which 


f (GB) hen we) 


all exist. Let 


and define 


Py a@(X) = ag +ay(x — a) +--+ +a,(x —a)". 


‘Then 
f(x) = Prca(%) 


Ya (x —a)" 
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PROOF 


Writing out P, 4(x) explicitly, we obtain 


n—| f(a) 


Ff) — = (x —a)' 
F(x) = Pra) _ 20: Ae f(a) 
(x-—a)" (x — a)" ni 
It will help to introduce the new functions 
n-l > 
f ll Ma ) 

O(x) = y Te —a)' and g(x)=(x—a)’; 

i=0 


now we must prove that 
f(x)— Q(x) f(a) 
x—a g(x) n! , 


Notice that 


O(a) = f(a), k<n-1, 


@ (x) =n\(x — a)" */(n —k)!. 


Thus 
lim [ f(x) — Q(x)] = f(a) — Q(a) = 0, 
lim [ f(x) — Q'(x)] = f'(a) — O(a) = 0, 
lim [ f(x) —Q" *)(x)] = f"— a) — Q"-?(a) = 0. 
and 
pe mae) Ea 0 


We may therefore apply /Hopital’s Rule n — 1 times to obtain 
‘ f(x) = O(x) ; FO Ye) — O@-D(x) 
im —_—— _ = im ———_- 
rsa (x —a)" ta ni(x —a) 
Since is a polynomial of degree n — 1, its (n — 1)st derivative is a constant: in 
pot) S 
fact, O"-Y(x) = f"-Y(a). Thus 
: f(x) = O(x) : ie Diy) = fa) 
lina AA Sc = bir RAR 
xa (x —a)" ta n'(x —a) 


and this last limit is f“"’(a)/n! by definition of f(a). § 


One simple consequence of ‘Theorem | allows us to perfect the test for local 
maxima and minima which was developed in Chapter 11. If a is a critical point 
of f, then, according to Theorem 11-5, the function f has a local minimum 
ata if f"(a) > 0, and a local maximum at a if f"(a) < 0. If f"(a) = 0 no 


(a) n odd 


(b) n even 


FIGURE 2 


THEOREM 2 


PROOF 
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conclusion was possible, but it is conceivable that the sign of f’’(a) might give 
further information; and if f’’(a) = 0, then the sign of fO@ =0 might be 
significant. Even more generally, we can ask what happens when 


(x) fila) = f(a) =--- = f" (a) = 0, 
7 May = 0. 


The situation in this case can be guessed by examining the functions 


FCO ee = 
Gr 


which satisfy (*). Notice (Figure 2) that if is odd, then a is neither a local 
maximum nor a local minimum point for f or g. On the other hand, if n is 
even, then f, with a positive nth derivative, has a local mimimum at a, while 
g, with a negative nth derivative, has a local maximum at a. Of all functions 
satisfying (*), these are about the simplest available; nevertheless they indicate the 
general situation exactly. In fact, the whole point of the next proof is that any 
function satisfying (*) looks very much like one of these functions, in a sense that 
is made precise by ‘Theorem I. 


Suppose that 


f'(a) a fe) =i) 
7 a) A 0. 


(1) If 7 is even and f(a) > 0, then f has a local minimum at a. 
(2) If n is even and f(a) < 0, then f has a local maximum at a. 
(3) If 7 is odd, then f has neither a local maximum nor a local minimum at a. 


There is clearly no loss of generality in assuming that f(a) = 0, since neither the 
hypotheses nor the conclusion are affected if f is replaced by f — f(a). Then, 
since the first n — | derivatives of f at a are 0, the ‘Taylor polynomial P,,q of f 1s 


/ (72) 
Oe ee 
i n! 


(x — a) 


a — a a Cara 


: f% (a) 
= ae 


n 


Thus, Theorem | states that 


fPlee) Sa) ic ya) me) 
CS Si || = ee Sk 


(x — a)" n! 


0 =ami 


xa (x — a)" xa 
Consequently, if x is sufficiently close to a, then 


fe) 


(x — a)” 


f™M@ 


has the same sign as 
n} 


Suppose now that 7 is even. In this case (x — a)" > O for all x 4 a. Since 
f (x)/(« —a)" has the same sign as f (a)/n! for x sufficiently close to a, it follows 
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that f(x) itself has the same sign as f"(a)/n! for x sufficiently close to a. If 
f(a) > 0, this means that 


f(x) > 0= fla) 


for x close to a. Consequently, f has a local minimum at a. A similar proof works 
for the case f(a) < 0. 
Now suppose that 2 is odd. ‘The same argument as before shows that if x is 


oat = eli y #0 sufficiently close to a, then 


Hf Y) 


always has the same sign. 
(x —a)" 2 


But (« —a)" > O for x > a and (x—a)" < O for x <a. Therefore f(x) has different 
signs for x > a@ and x <a. ‘This proves that f has neither a local maximum nor 
a local minimum at a. J 


Although Theorem 2 will settle the question of local maxima and minima for 
just about any function which arises in practice, it does have some theoretical 
f= “<— x70 limitations, because f“) (a) may be 0 for all k. This happens (Figure 3(a)) for the 
0, x=0 function 
. RA. eal 
(b) T@ = e [x : Xx x 0 
0, ie Eth 


which has a minimum at 0, and also for the negative of this function (Figure 3(b)), 
which has a maximum at 0. Moreover (Figure 3(c)), if 


eal x>0 
f(x) = 4 0, 0) 


Sol Pre 0. 


then f“ (0) = 0 forall k, but f has neither a local minimum nora local maximum 


=k 0 at O. 
(c) The conclusion of Theorem 1 is often expressed in terms of an important con- 
FIGURE 3 cept of “order of equality.” ‘Two functions f and g are equal up to order n 
at aif 


lim J Neat 810) ) 

fad eg)? 
In the language of this definition, Theorem | says that the Taylor polynomial 
Pra. equals f up to order n at a. ‘The Taylor polynomial might very well have 
been designed to make this fact true, because there is at most one polynomial of 
degree < n with this property. ‘This assertion is a consequence of the following 
elementary theorem. 


THEOREM 3 Let P and Q be two polynomials in (x — a), of degree < n, and suppose that P 
and Q are equal up to ordern at a. Then P = Q. 


proor Let R = P — Q. Since R is a polynonual of degree < n, it is only necessary to 


COROLLARY 


PROOF 
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prove that if 
R(x) = bo +--+ + by (x — a)" 
satisfies 
R(x) 


lim ———— = QO, 
x>a (xX — aye 


then R = 0. Now the hypothesis on R surely imply that 


. RO 
hm ————— 


- Oo OF; Bur 
x>a (x — a)! 


For i = 0 this condition reads simply lim R(x) = 0; on the other hand, 
Or af 2} 
lhim** () = limp ae = ay + Dax — a) | 
Ae X= a 
= bo. 


Thus bp = 0 and 
R(x) = bh (x —a) +--+» +. b,(x — 2)". 


Therefore, 


R x¢ 
= bj + box — a) + - Fb, (aw — ay"! 
x-—a 
and 
lim =e 
BoA Se = (i) 


Thus b; = 0 and 
R(x) = by(x — a)? +--+» 4b, (x — 2)". 
Continuing in this way, we find that 


bo =::-=b, =0.] 


Let f be n-times differentiable at a, and suppose that P is a polynomial in (x — a) 
of degree <n, which equals f up to order n at a. Then P = lca. f: 


pines P and PP. ¢ both equal f up to order n at a, it is easy to see that P equals 
Pna,f Up to order n at a. Consequently, P = Pya,¢ by the Theorem. | 


At first sight this corollary appears to have unnecessarily complicated hypotheses: 
it might seem that the existence of the polynomial P would automatically imply 
that f is sufficiently differentiable for P,.a.f to exist. But in fact this is not so. For 
example (Figure 4), suppose that 


ewe xt! y irrational 
i 0, X rational. 

If P(x) = 0, then P is certainly a polynomial of degree < n which equals f up to 
order n at 0. On the other hand, f’(a) does not exist for any a £0, so f’(0) is 


undefined. 
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x irrational 
x rational 


n+l 


7 x < 
x)= 
f(a 0. 


: 
. 
: 
. 
. 
. 
: 
: 
. 
: 
. 
Y 
* 
: 
: 
: 
. 
Pod 


FIGURE 4 
When f does have n derivatives at a, however, the corollary may provide a 
useful method for finding the ‘Taylor polynomial of f. In particular, remember 
that our first attempt to find the Taylor polynomial for arctan ended in failure. 
x | 
arctan x =| ——= dt 
9 1+ 
divide | 


The equation 


suggests a promising method of finding a polynomial close to arctan 
(1 yrtl p2n+2 


by 1 + 17, to obtain a polynomial plus a remainder: 
2 A 6 n,2n 
—— = 1-4 -t+---+(-1)"t" + ; 
Let Ee eat Se 
rmi: . . . . . - 9 
This formula, which can be checked easily by multiplying both sides by | + f°, 
shows that 
x . 5 X pont2 
arctan x = / Leh an yd eae / ——~ dt 
0 g;) Late 
x? ro ‘ x2nt ! a x ten +2 
=x—- > +2 --+-+(-1)"—— +(-]) ——— dt. 
3 5 2n + | Jo 1+t- 
According to our corollary, the polynomial which appears here will be the ‘Taylor 
polynomial of degree 2n + | for arctan at 0, provided that 
¥ p2nt2 
~ dt 
o 1+t- 
lim ; =0 
xen 


-{2n+3 

\ NE ejen 
fae lic TT Bi 

0 2n+3 


Since 
r 2) 
/ ~dt| < 
Oo Le ‘ 
this is clearly true. ‘Thus we have found that the ‘Taylor polynomial of degree 


.2n+l 


ei i ——, 
2n + | 


2n + 1 for arctan at O is 
a) 

x 
5 


3 
= ip 
Pon's 1.0(X) =x— 3 + 
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By the way, now that we have discovered the ‘Vaylor polynomials of arctan, it is 
possible to work backwards and find arctan”) (Q) for all k: Since 


x3 .2n+l1 
|e 2n = i on - —1)"-= 
2 ee) — 3 ar 5 Se (| See om ere 


and smee this polynomial is, by definition, 


(23 
arctan '~)(Q) 3 arctan 2”+)(Q) 


are (Oe aarctan (0) + =" 7-7 ee Ee 
2! (2n+ 1)! 
we can find arctan (0) by simply equating the coefficients of x* in these two 
polynomials: 
arctan) (0) aa 
———_— =() if k iseven, 
k! 
A 2!+1) 
arctan! (O) a) 
SS ae rs arctan | O) = (alae Ce 
(21+ 1)! 2/+ 1 


A much more interesting fact emerges if we go back to the original equation 


x3 Oo yet i X p2n+2 
arctanx = x — — + > —---+(—1)"—— 4+ (- 1)" i ———— 1} 
: 715 on alge ge He 


ae Lage 
< ——. 
[ pg rire. ~ In+3 


When |x| < 1, this expression 1s at most 1/(2” + 3), and we can make this as 
small as we lke simply by choosing n large enough. In other words, for |x| < | 
we can use the Taylor polynomials for arctan to compute arctan x as accurately as we like. 


and remember the estrmate 


The most mportant theorems about Taylor polynomials extend this tsolated result 
to other functions, and the ‘Taylor polynomials will soon play quite a new role. 
The theorems proved so far have always examined the behavior of the ‘Taylor 
polynomial P,.¢ for fixed n, as x approaches a, Henceforth we will compare Taylor 
polynomials Pq for fixed x, and different n. In anticipation of the comimg theorem 
we mtroduce some new notation. 

If f is a function for which Py.q(x) exists, we define the remainder term 

Rya() by 


f(x) 5 Pn ake nee (x) 


>(72) 
= f(a)t+ f(a(e —a) +--+ + ae ie eel ava), 


We would like to have an expression for Ry a(x) whose size is easy to estimate. 
There is such an expression, volving an integral, just as m the case for arctan. 
One way to guess this expression ts to begin with the case nr = 0: 


F(x) = fla) + Roa(x). 
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The Fundamental Theorem of Calculus enables us to write 


f(x) = fla) + ia f(t) dt, 


so that 


x 
af 
Rowa(t) = / fat. 
a 

A similar expression for Rj 4(%) can be derived from this formula using integra- 

tion by parts in a rather tricky way: Let 

u(t)= f(t) and v(t)=t-—x 

(notice that x represents some fixed number in the expression for u(t), so v'(t) = 1); 


then . : 
/ par= | Ff 1 dt 
a a ap { 


u(t) v(t) 


- [fae aat 
a a { 1 


u(t) v(t) 


u(t)v(t) 


Since v(x) = 0, we obtain 


f= fas f rand 
= f(a) —u(a)v(a) +/ f'(t)\(x —t)dt 


= f(a) + f'(a\(x —a)+ i f'(t)(x —t)dt. 


‘Thus 


Ri g(x) = o Famtales —t)dt. 


It is hard to give any motivation for choosing v(t) = tf — x, rather than v(t) =. 
It just happens to be the choice which works out, the sort of thing one might 
discover after sufficiently many similar but futile manipulations. However, it is 
now easy to guess the formula for R2,(x). If 
—(% = t)? 
—_—— 


- 


x x er —(x Fs t)? 
-| f >, at 


PMOC=ar, FP Oe. oo 


na) =} @) and 2@)-= 


then v’(t) = (x — ft), so 


u(f)u(t) 


[ f' (t)(x —t)dt 


This shows that 


- 


> (3) 
t) 
Ro q(x) — i S S (XY = 1) dt. 


_ 


You should now have little difficulty giving a rigorous proof, by induction, that 


LEMMA 


PROOF 
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if f+ is continuous on fa, x], then 
EG fDi) 
Ras) = f pe ae 
} n! 


This formula is called the integral form of the remainder, and we can easily esti- 
mate it in terms of estimates for ff") /nt on [a,x]. If mand M are the minimum 
and maximum of f@+) /n! on [a,x], then Ry @(x) satisfies 


ae x 
mf fa 1) df = Ry) = uf (x —1)dt, 
a a 
sO we can write 
(nee ajtt! 


n+1 


for some number @ between m and M. Since we've assumed that ft) is con- 
tinuous, this means that for some ¢ in (a, x) we can also write 


lee (x) = re 


(n+1) 1 2 WEEE (n+1) 
USA Be) ea f o fis pa —_- = Jas (x — aa, 
n! n+1 (n+ 1)! 


which ts called the Lagrange form of the remainder (these manipulations will look 
familiar to those who have done Problem 13-23). 

The Lagrange form of the remainder is the one we will need in almost all 
cases, and we can even give a proof that doesn’t require f’* to be continuous 
(a refinement admittedly of little importance in most applications, where we often 
assume that f has derivatives of all orders). ‘This is the form of the remainder that 
we will choose in our statement of the next theorem (Taylor’s Theorem). 


Suppose that the function R is (n + 1)-times differentiable on [a,b], and 
R“(a@)=0 fork=0, | Bee Ae ae 
Then for any x in (a, b] we have 


R(x) RG 
=, < ae me comer neq 
(x —a)rtl (n+ 1)! 
For n = 0, this is just the Mean Value T heorem, and we will prove the theorem 
for all n by induction on n. To do this we use the Cauchy Mean Value Theorem 
to write 


Re) FC, es 


Oa (n+2)c¢—aytl ® 1 ee a)"+1 


for some z in (a,x), 


and then apply the induction hypothesis to R’ on the interval la, z] to get 


R(x) eB, rin ¢ 
oor = oa for some t it (a, z 
@—ay't2 nt2 @+)) . 

Rep) 


7 (n +2)! 
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THEOREM 4 (TAYLOR’S THEOREM) Suppose that f’,..., f"* are defined on [a,x], and that Ry q(x) is defined by 


: , f(a) ° 
f(x) = fla) + f (@(4 —a)+---+ | (* — a)" + Rasa (x): 
Then 
1) (4) 
, Neate 8: ; 

Regi — wine —a)"*! for some t in (a,x) 
n ! 
(Lagrange form of the remainder). 

PROOF The function R,4 satisfies the conditions of the Lemma by the very definition of 


the ‘Taylor polynomial, so 


Rasa) RG 
(x—ay"t! (n+ 1)! 


for some f in (a,x). But 


(n+l) _ g¢(n+1) 
Rn. =f 


since Ry@ — f isa polynomial of degree n. J 


Applying Taylor’s Theorem to the functions sin, cos, and exp, with a = 0, we 
obtain the following formulas: 


3 Bs) -2n+1 +. (2n+2) 
F x X . sin (hi)! <oscea 
OT ee ee eS (ees [ee nse ee leis 
3! 5! (2n + 1)! (2n +2)! 
-2 4 _2n (2n+1) 
Xx Xx 3 COS (ft) In+1 
COS ee a ee re 
2 2" 4! (Qn)! (n+)! 
2 n 
a ; e n+l 


ee 
2! ni! (n+ 1)! 
(of course, we could actually go one power higher in the remainder terms for sin 


and cos). 
Estimates for the first two are especially easy. Since 


a 2) ? ~ 
lsin“"*(r)| < 1 for all t, 


we have 


- _ (2n+2) 2n+2 
1 Neidio eee rs 
xr“) < ——__., 
(2n + 2)! (2n + 2)! 
Similarly, we can show that 
9 ? 
cos'2"t) (pr) ree |x jen! 


aay gee 
(2n + 1)! —~ (2n+ 1)! 
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These estimates are particularly interesting, because (as proved in Chapter 16) for 
any € > 0 we can make 


by choosing n large enough (how large n must be will depend on x). This enables 
us to compute sinx to any degree of accuracy desired simply by evaluating the 
proper ‘Taylor polynomial P,o(x). For example, suppose we wish to compute 
sin 2 with an error of less than 10-4, Since 


2n+2 


sin2 = P3,41,0(2) +R, where |R] < Qaai 


we can use P2,41 9(2) as our answer, provided that 
2n+2 

pee es 

(2n + 2)! 


A number n with this property can be found by a straightforward search-—it ob- 
viously helps to have a table of values for n! and 2” (see page 432). In this case it 
happens that n = 5 works, so that 
sin 2 = Pi1.9(2) +R 
93 ae at 29 me) 


—2> 4 eR, 
ssl a op eo 
where |R| < 107+. 
It is even easier to calculate sin | approximately, since 
inl = Popig wn eR here |R| 
sin P54 , Where < ——_., 
; ee (Qn +2)! 


To obtain an error less than € we need only find an n such that 


| 
(roe © 


and this requires only a brief glance at a table of factorials. (Moreover, the indi- 
vidual terms of P2,,1 9(1) will be easter to handle.) 


For very small x the estimates will be even easier. For example, 


| 


102*+2(2n + 2)! 


I | 
sin 10 = Prn41,0 (=) - RK, where|R| < 


To obtain [R| < 107! we can clearly take n = 4 (and we could even get away 
with n = 3). These methods can actually be used to compute tables of sin and 
cos; a high-speed computer can compute P,4).9(x) for many different x in almost 
no time at all. Nowadays, computers, and even cheap calculators, determine the 
values of such functions “on-the-fly”, though by specialized methods that are even 
faster. 
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Estimating the remainder for e* is only slightly harder. For simplicity assume 
that x > O (the estimates for x < 0 are obtained in Problem 15). On the interval 
[O, x] the maximum value of e' is e*, since exp is increasing, so 


exxrt l 


06 
(n+ 1)! 

Since we already know that e < 4, we have 

et yitl 4r yt 

ec tae a << Ee ic 

(1+ 1)! (n+ 1)! 
which can be made as small as desired by choosing 7 sufhciently large. How large 
n must be will depend on x (and the factor 4° will make things more difficult). 
Once again, the estimates are easier for small x. If 0 < x < |, then 


2 nN 


BR x 
=l+x+—4+---+—4+R, where 0 < R < ———_. 
: : 2! n! ’ (n+ 1)! 
In particular, if n = 4, then 
OR : 
< —<—, 
os) 0 
so 
=, je Ee Ae RS 
e=e =I1+ a> Sie , Where OU < =a 
24 
which shows that 
2 Gas 
(This then shows that 
3x ntl 
O -=R = = 
(1+ 1)! 
allowing us to improve our estimate of R slightly.) By taking n = 7 you can 


compute that the first 3 decimals for e are 
2 ahs ows 


(you should check that 1 = 7 does give this degree of accuracy, but it would be 
cruel to insist that you actually do the computations). 

The function arctan is also important but, as you may recall, an expression for 
arctan (x) is hopelessly complicated, so that our expressions for the remainder 
are pretty useless. On the other hand, our derivation of the ‘Taylor polynomial for 
arctan automatically provided a formula for the remaimder: 

x rea yry ent (| yrtl pant2 
+O f 
0) 


AVGtall y — 1 ee oe 
3 2n + ] | + 12 


dt. 
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As we have already estimated, 


x peel yee x F 
Gal) 5 |x 
eee et = ponte gy SS =. 

0 }1+t- 0 2n +3 


2n+3 


For the moment we will consider only numbers x with |x| < 1. In this case, the 
remainder term can clearly be made as small as desired by choosing 7 sufficiently 
large. In particular, 


|e (-1)” l 
arctan] =1—=+—2--:: +R, where |R| < ——~. 
2n+3 


a 35 2n +1 
With this estimate it is easy to find an n which will make the remainder less than 
any preassigned number; on the other hand, # will usually have to be so large as to 
make computations hopelessly long. To obtain a remainder < 10~7, for example, 
we must take n > (107 — 3)/2. This is really a shame, because arctan 1 = 7/4, 
so the ‘Taylor polynomial for arctan should allow us to compute z. Fortunately, 
there are some clever tricks which enable us to surmount these difficulties. Since 
| ‘ [2n+3 


Ro (x)| < ———~, 
| Ron+1,0(%)| m3 


much smaller #’s will work for only somewhat smaller x’s. The trick for computing 
7m is to express arctan | in terms of arctan x for smaller x; Problem 6 shows how 
this can be done in a convenient way. 
From the calculations on page 413, we see that for x > 0 we have 
3) d — 
x2 ro x4 (1) yn (—1)" ne 

log] +x) =x —- ~— + — — — 4+ --- 4+ ——— + —— I 
2 6 + H nl 
where 

=i VE yntl 

ay) yet eo x 


eel ~ant+ di 


and there is a slightly more complicated estimate when —1 < x < 0 (Problem 16). 
For this function the remainder term can be made as small as desired by choosing 
n sufficiently large, provided that —1 <x < 1. 

The behavior of the remainder terms for arctan and f(x) = log(x + 1) is quite 
another matter when |x| > 1. In this case, the estimates 


|x |2n+3 
Ron+1,0(4)| < ——~ _ for arctan, 
| n+], | a + 3 
xhtl 
[Rno(x)| < Feiy (x > 0) for fe 


are of no use, because when > | the bounds x”/im become large as m be- 
comes large. This predicament is unavoidable, and is not just a deficiency of our 
esumates. It is easy to get estimates in the other direction which show that the 


remainders actually do remain large. ‘To obtain such an estimate for arctan, note 


xX 
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thatiier ism (0) )s(orem: [a5 Ot x <0), then 


Er ae RE nc SI 
2n+1 


ae 1 Xx x 
is > / cruel ae 
0 14+f =F C 246 | Jo 4n+6 


‘To get a similar estimate for log(] ++), we can use the formula 


1 9 = = (—1)"t" 

ee 97 es ae =" ” ; 

Es. - +(-1 ag 

to get 
x] x Z x 3 x 
log(1 so) = i= lr hee a eee 
og(l + x) fo Mee pleiea + (-1) = 
+(-—1)" ‘ea; 
rae i ee 


If x > 0, then for ¢ in [0, x] we have 


l+4 See = 


[ r” = | ic Fi = yaa 
— ar= 
0 ae ek 0 oD: 


These estimates show that if |x| > 1, then the remainder terms become large as 
n becomes large. In other words, for > 1, the ‘Taylor polynomials for arctan 
and f are of no use whatsoever in computing arctan x and log(x + 1). This is no tragedy, 
because the values of these functions can be found for any x once they are known 
for all x with |x| < 1. 


so 


Xx 


This same situation occurs in a spectacular way for the function 


anes alae es 
fia = 0. an 


We have already seen that f“)(O) = 0 for every natural number k. This means 
that the Taylor polynomial P,.9 for f is 


"O) > “OO 
SOWA IO, 


P, o(x) = 
=) 


n! 


In other words, the remainder term R,9(¥) always equals f(x), and the ‘Taylor 
polynomial is useless for computing f(x), except for x = 0. Eventually we will be 
able to offer some explanation for the behavior of this function, which is such a 
disconcerting illustration of the limitations of Vaylor’s Theorem. 

The word “compute” has been used so often in connection with our estimates 
for the remainder term, that the significance of Taylor’s Theorem might be mis- 
construed, It is true that ‘Vaylor’s ‘Theorem can be used as a computational aid 


THEOREM 5 


PROOF 
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(despite its ignominious failure in the previous example), but it has even more im- 
portant theoretical consequences. Most of these will be developed in succeeding 
chapters, but two proofs will illustrate some ways in which ‘Taylor’s Theorem may 
be used. The first illustration will be particularly impressive to those who have 
waded through the proof, in Chapter 16, that z is irrational. 


é is irrational. 


We know that, for any n, 


l | | : 
a0 SA ae en Re, «where Oe, = — 
it 2! n! (n+ 1)! 
Suppose that e were rational, say e = a/b, where a and b are positive integers. 
Choose n > b and also n > 3. Then 


a 
so 
nla n! n! 
Se Te pe nly, 
b 2! n! 


Every term in this equation other than n!R,, is an integer (the left side is an integer 
because n > b). Consequently, n!R,, must be an integer also. But 
0 <R =n 
<R, < ; 
(n + 1)! 


so 


0 <n'!R, < 
n 


which is impossible for an integer. § 


The second illustration is merely a straightforward demonstration of a fact 
proved in Chapter 15: If 


f" + f =9, 
f(0) =0, 
f'(0) = 0, 


then f = 0. To prove this, observe first that f\) exists for every k; in fact 


f® = Cf’ = ey 
f® = (fy! = (—f’) == —f” — f. 


f = Fy’ = f q 


etc. 
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This shows, not only that all f“ exist, but also that there are at most 4 different 
ones: f, f’, —f, —f'. Since fC) = f’() = 0, all f (©) are 0. Now Taylor’s 


Theorem states, for any n, that 


Ak ft ne) é 
eo — EUS (x —a) 


: Z “5 : ( 6 y 5 P 
for some t in {0, x]. Each function f*) is continuous (since f+) exists), so for 
any particular x there is a number M such that 


peter <M “for 0 =a = x, andall 7 


(we can add the phrase “and all n” because there are only four different f“). 


Thus 


M\x leer 
Sf CHa 
(n+ 1)! 
Since this is true for every 7, and since |x|"/n! can be made as small as desired by 
choosing n sufficiently large, this shows that | f(x)| < ¢ for any e > 0; consequently, 


fF =. 


The other uses to which ‘Vaylor’s ‘Theorem will be put in succeeding chapters 
are closely related to the computational considerations which have concerned us 
for much of this chapter. If the remainder term R,.¢(x) can be made as small as 
desired by choosing n sufficiently large, then f(x) can be computed to any degree 
of accuracy desired by using the polynomials P,¢(x). As we require greater and 
greater accuracy we must add on more and more terms. If we are willing to add 
up infinitely many terms (in theory at least!), then we ought to be able to ignore 
the remainder completely. ‘here should be “infinite sums” like 


ee oe yt 
ee 4, 1b a> ap 
oe ee | yo re 
COS X = my ka ol mh 
es 2 x3 x4 
a — ot ot 3a ae” 
a ye 7 
x : ; 
BTC oom = if og! <= We 
x2 x yx 
ad =_'> paeeieee  b  e 


We are almost completely prepared for this step. Only one obstacle remains 
we have never even defined an infinite sum. Chapters 22 and 23 contam the 


necessary definitions. 
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PROBLEMS 


Ii 


Find the ‘Taylor polynomials (of the indicated degree, and at the indicated 
point) for the following functions. 


(i) f(x) =e* ; degree 3, at 0. 

(ii) f(x) =e™* degree 3, at 0. 

(ii) sin; degree 2n, at = 

(iv) cos; degree 2n, at 7. 

(v) exp; degree n, at 1. 

(vi) log; degree n, at 2. 

(vii) f(x) =x° +x? +x; degree 4, at 0. 
(viii) f(x) =x? +x7 +x; degree 4, at 1. 


I 
(ix) f(x) = ——; degree 2n + 1, at 0. 


l+x 


Gx} FG) = 


: degree n, at 0. 


l+x 


Write each of the following polynomials in x as a polynomial in (x — 3). (It 
is only necessary to compute the ‘Taylor polynomial at 3, of the same degree 
as the original polynomial. Why?) 


Gj) x*—4x-—9, 
Gi) x* — 12x34 44x? 4 2x +1. 
(iii) x. 


(iv) ax*+bx+c. 


Write down a sum (using ) notation) which equals each of the following 
numbers to within the specified accuracy. To minimize needless computa- 
tion, consult the tables for 2” and n! on the next page. 


(i) sin l; error < 107!’. 
(ii) sin 2; error < 107!?. 
(ii) sin 5; error < 10772. 
(iv) e; error < 1074. 


(vy) e*: error < 107°. 
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n rie n! 
] 2 l 
2 4 2 
3 8 6 
4 16 94 
5 2 120 
6 64 720 
| 128 5,040 
8 256 40,320 
9 S12 362,880 
10 1,024 3,628,800 
11 2,048 39,916,800 
2 4,096 479,001,600 
13 8,192 6,227,020,800 
14 16,384 87,178,29 1,200 
15 Bo . 166 1,307,674,368,000 
16 65,536 20,922,789,888,000 
7 O72 355,687,428 ,096,000 
18 262,144 6,402 ,373,705,728,000 
19 524,888 121,645, 100,408,832,000 
20 1.048.576 2 432,902,008, 176,640,000 


*4, ‘This problem is similar to the previous one, except that the errors demanded 
are so small that the tables cannot be used. You will have to do a little 
thinking, and in some cases it may be necessary to consult the proof, in 
Chapter 16, that x”/n! can be made small by choosing n large - the proof 
actually provides a method for finding the appropriate n. In the previous 
problem it was possible to find rather short sums; in fact, 1t was possible 
to find the smallest 2 which makes the estimate of the remainder given by 
‘Taylor’s Theorem less than the desired error. But in this problem, finding any 
specific sum is a moral victory (provided you can demonstrate that the sum 
works). 


. . 10 
i) sin |; /crrort= 10>00">, 


oe 


iiyeePerror ale | OOS 


coe jal 30, 


( 

( 

(iii) sin 10; error < 10-79, 
(iv) error < 107 
( 


v) arctan 1; error < 10710"), 

5. (a) In Problem 11-41 you showed that the equation x* = cos.x has pre- 
cisely two solutions. Use the third degree ‘Taylor polynomial of cos to 
show that the solutions are approximately +,/2/3, and find bounds on 
the error. ‘Then use the fifth degree ‘Taylor polynomial to get a better 
approximation. 

(b) Similarly, estimate the solutions of the equation 2x? = x sin.x + cos? y. 
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(a) Prove, using Problem 15-9, that 


XN i ] 

— = arctan — + arctan —, 

4 D 3 

XN i | | 
— = 4 arctan = — arctan —. 
4 5 239 


(b) Show that 7 = 3.14159.... (Every budding mathematician should ver- 
ify a few decimals of 2, but the purpose of this exercise is not to set you 
off on an immense calculation. If the second expression in part (a) is 
used, the first 5 decimals for 7 can be computed with remarkably little 
work.) 


Suppose that a; and b; are the coefficients in the Taylor polynomials at a of f 
and g, respectively. In other words, a; = f(a)/i! and bj = g®(a)/i!. Find 
the coefficients c; of the Taylor polynomials at a of the following functions, 
in terms of the a;’s and b;’s. 


(af see 
ce may. 
Gi) of 


(iv) nix) = f f(t) dt. 
(v) kor = f f(tydt. 
0 


(a) Prove that the Taylor polynomial of f(x) = sin(x*) of degree 4n +2 at 0 
is 


: 5) 
, x 10 yinte 


woman st!) Gh! 


Hint: If P is the Taylor polynomial of degree 2n + 1 for sin at 0, then 
six = P(x) R(x) where lim R(x)/x?"*+! = 0. What does this imply 


about lim R(x2)/x7"+2? 


(b) Find f(0) for all k. 
(c) In general, if f(x) = g(x”), find f™(0) in terms of the derivatives of g 


at O. 


x 


The ideas in this problem can be extended significantly, in ways that are explored 


in the next three problems. 


ve 


(a) Problem 7 (i) amounts to the equation 
Pua, f-+e2 — Pua, f +e Py gee 
Give a more direct proof by writing 


ie) SS Wa Si (eo) Rava, f (x) 
g(x) = Praise (x) = Ria.g Ce), 


and using the obvious fact about Rya.¢ + Raa.e- 


434 Infinite Sequences and Infinite Series 


10. 


11. 


(b) Similarly, Problem 7 (11) could be used to show that 


Pra, fe = eeu z ler nab 


where | P],, denotes the truncation of P to degree n, the sum of all 
terms of P of degree < n [with P written as a polynomial in x — a]. 
Again, give a more direct proof, using obvious facts about products in- 
volving terms of the form R,. 
(c) Prove that if p and g are polynomials in .«—a and lim R(x)/(x—a)" = 0, 
then ie 
p(q(x) + R(x)) = p(q(x)) + RW) 


where 
hm ROW a) 


Also note that if p is a polynomial in x — a having only terms of degree 
> n, and q 1s a polynomial in x — a whose constant term is 0, then all 
terms of p(q(x — a)) are of degree > n. 

(d) lia Oxindko—"2(ar= Oilien 


Pi afes = aes. o eotreene 


(Problem 8 is a special case.) 
(e) ‘The same result actually holds for all a and any value of g(a). Hint: 
Consider F(x) = f(x+g(a)), G(x) = g(x +a) and AH (x) = G(x)— g(a). 
(i) eLh (a) Omen 


Prats = [! a Prag a (ne We as Renae 


For the followmg applications of Problem 9, we assume a = O for simplicity, 
and just write P,,¢ mstead of Py a, 5. 


(a) Bor 9 Gols eames (x) = sinix, md 2s (a), 
(b) For the same f and g, find P5, rg. 
(c) sind Pstanx). flint: First use Problem 9°(f) andthe value of 5,.¢,(4)te 
mS) a 
find P54 cog mene iswer: Xx + = + =) 
(d) Find Py, for f(x) = e2* cosx. (Answer: 1 + 2x + 3x? + ix? _ 44x") 
(e) Find Ps} for f@) =sin x/ cos 2x. (Answer: x + ix + 30h x9) 


(f) Kind Pe! ploy Gi x?/[C!+x2)e*]: (Answer: x2 — x7 — 5x + > x°) 


Calculations of this sort may be used to evaluate limits that we might other- 
wise try to find through laborious use of PH6pital’s Rule. Find the following: 
: 9) 
C= | ee . N(x) 


(a) laa ————-——. = _ Im —., 
56 x —sm7x x30 D(x) 


Hint: First find P39.y(¥) and P3.9,p(x) for the numerator and denomi- 
nator N(x) and D(x). 


12. 


13: 


14. 
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- +x = 
(b) lim LS, 
x0 x—sinx 


Hint: For the term e*/(1+.), first write 1/(1 +x) = l—x+x?—x3+-. 


ey ij ( | ] ) 
(c) lim — — }, 
x>0\sin?x x? 


| — cos(x”) 


(ae ins = 
x30 x-sin’ x 


; l 
(e) lim = = So 
r>0 sin“ x  sin(x*) 


. .  (sinx)(aretan x) — x? 
(f) bay —<—_$ ————  —<§— 


x0 1 — cos(x2) 


Let ; 
sin x 


x #0 
l, x =0. 


Starting with the Taylor polynomial of degree 2n + | for sin x, together with 
the estimate for the remainder term derived on page 424, show that 


a= 


2 4 .2n 
x x ” % y -- 
fH (1— 3) + 5 ++~ + (—]) Qn +1! + Ron.0, 6 (*) 
where 
“ J : [x |r! 
an x)| << ————_, 
ans0.F (2n + 2)! 


and use this to conclude that 


l | 2 4 
er. x 1703 
ae Se a Nae Gan 
if f [ 3c ° =) = Fe00 


with an error of less than 107°. 


Let 


Pie.) x 


(a) Find the Taylor polynomial of degree n for f at 0, compute f“) (0), and 
give an estimate for the remainder term R,, 0. ;. 


| 
(b) Compute | f with an error of less than 107+. 
0 


0.1 
ae > 5 : 5 
Estimate / exp(x~) dx with an error of less than 10 5. 
0 
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1S: 


16. 


“17. 


18. 


19: 


Prove that if x <0, then the remainder term R,, 9 for e* satisfies 


aut 


R, = ira Ea 
Knol (n + 1)! 


Prove that if —1 < x < O, then the remainder term R,.9 for log(1 +x) 
satishes 
ae 


Gy Se ee 
0 aca cari 


(a) Show that if |g’(x)| < Mlx — a|" for |x — a| < 6, then |g(x) — g(a)| < 
Myo al a: 1) fondle 


(b) Use part (a) to show that if lim g’(x)/(« — a)" = 0, then 


ce St — 8(G) 
lim a 
x>a (x —a y+ 


(c) show tiie) = f(x) — PF; ayy (x), then 2 Oy — yee 


(d) Give an inductive proof of Theorem 1, without using ?H6pital’s Rule. 


Deduce Theorem 1 as a corollary of ‘Taylor’s Theorem, with any form of 
the remainder. (The catch is that it will be necessary to assume one more 
derivative than in the hypotheses for Theorem 1.) 


Lagrange’s method for proving ‘Taylor’s Theorem used the following device. 
We consider a fixed number x and write 


pow 
| 


ny 


) fy) + fO@—H)+---4 Ca 0)" + ISR) 

for S(t) = Ry.(x). The notation is a tip-off that we are going to consider the 
right side as giving the value of some function for a given f, and then write 
down the fact the derivative of this function 1s 0, since it equals the constant 
function whose value is always f(x). ‘To make sure you understand the roles 
of x and r, check that if 


(k) 
(f ; 
t= ae L Ee 
then 
“(k) (k+1) t 
(t= J ak =m (o) + es =): 
(k) (k+1) 
fe! W) (c—s) Pee era 


(k — 1)! k} 


20. 


20. Approximation by Polynomial Functions 437 


(a) Show that 


O= f+ [=r + J wie - 7) 


a 
i ~ te TO ae —1) 
oRey 
ao ue Stee al eae 
+ S'(t), 
and notice that everything collapses to 
S(t)=— ue = 7)". 


Noung that 
S(x) = ins (x) = 0, 


S(a) = Raia (x), 


apply the Cauchy Mean Value Theorem to the functions S and h(t) = 
(x —t)"*! on [a,x] to obtain the Lagrange form of the remainder 
(Lagrange actually handled this part of the argument differently). 

(b) Similarly, apply the regular Mean Value Theorem to S to obtain the 
strange hybrid formula 


7 FUT 


Ria (x) 
Ny 


(x —t)"(x —a). 


This is called the Cauchy form of the remainder. 


Deduce the Cauchy and Lagrange forms of the remainder from the integral 
? 5 fa) 5 
form on page 423, using Problem 13-23. There will be the same catch as nn 


Problem 18. 


I know of only one situation where the Cauchy form of the remainder is used. 


The next problem is preparation for that eventuality. 


20. 


kor every number a, and every natural number n, we define the “binomial 


(j)-—— re 


coefficient” 


n!} 


: a f c Ce \e: 
and we define te = I, as usual. If @ is not an integer, then ( ) is never O, 
n 
and alternates in sign for > a. Show that the Taylor polynomial of degree n 


for? Gal — (hp a) ae s8Pry) GL | ee and that the Cauchy and 
k=0 


Lagrange forms of the remanider are the following: 
5 5 5 
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228 


Zoe 


Cauchy form: 


a(a—l1)-...- ao — ny 
R,0@ => se OT Ji ete 
ee )) Oe) a eee 
n! | re 


Xx af n 
(n+ (, i ,)x« 49) () _ ee fae(Oe) om, 0): 


Lagrange form: 
a{a—1)-...-(@a—n) \ ae | 
R,, a erat | 1)% n 
(x) errs cata CS) 


= (, , jer Eeyore PaO lor tie (0). 


Estimates for these remaimder terms are rather difficult to handle, and are 
postponed to Problem 23-21. 


(a) Suppose that f is twice differentiable on (0, 00) and that |f(x)| < Mo 


for all x > O, while |f”(x)| < Mo for all x > 0. Use an appropriate 
Taylor polynomial to prove that for any x > 0 we have 


Z } 
mio = ; Mo ~ <M) for all h > O. 
1 8) 


(b) Show that for all x > 0 we have 


If'(x)| <2 MoM). 
Hint: Consider the smallest value of the expression appearing in (a). 
(c) If f is twice differentiable on (0,00), f” 1s bounded, and f(x) ap- 
proaches 0 as x — oo, then also f’(x) approaches 0 as x > oo. 
(d) If dm f (x) exists and dm f (OOhemisteatinen Jim eS iim ioe 
0. (Compare Problem 11-34.) 


(a) Prove that if f”(a) exists, then 


} . flath)t+ fla—h) -—2 fla) 
f'(a~) = kh A. 
h—0 he 
The limit on the right is called the Schwarz second derwative of f ata. Hint: 
Use the Taylor polynomial P) q(x) with x = a+ and with x =a —h. 
(b) Let #@.5— x? for x > 0, and —x? for x < 0. Show that 


» {0+ 'h) + F(0— fh) — 270) 
aS =— 


exists, even though f”(0) does not. 
(c) Prove that if f has a local maximum at a, and the Schwarz second 
derivative of f at @ exists, then it is < 0. 


24. 


Aas 


o226, 
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(d) Prove that if f’"(a) exists, then 


ae ill) eee ae a a. 
3 h-0 he 


Use the ‘Taylor polynomial P;,,¢, together with the remainder, to prove a 
weak form of Theorem 2 of the Appendix to Chapter 11: If f” > 0, then 
the graph of f always les above the tangent line of f, except at the point 
of contact. 


Problem 18-43 presented a rather complicated proof that f = Oif f”—f = 0 
and f(0) = f’(0) = 0. Give another proof, using ‘Taylor’s Theorem. (This 
problem is really a prelimimary skirmish before doing battle with the general 
case in Problem 26, and is meant to convince you that Taylor’s Theorem is 
a good tool for tackling such problems, even though tricks work out more 
neatly for special cases.) 


Consider a function f which satisfies the differential equation 


n—1 


ie _ > apm 


j=0 


for certain numbers dg, ...,@,—1. Several special cases have already received 
detailed treatment, either in the text or in other problems; in particular, we 
have found all functions satisfying f’ = f, or f” + f = 0, or f”— f =0. The 
trick in Problem 18-42 enables us to find many solutions for such equations, 
but doesmt say whether these are the only solutions. ‘This requires a unequeness 
result, which will be supplied by this problem. At the end you will find some 
(necessarily sketchy) remarks about the general solution. 


(a) Derive the following formula for ft (let us agree that “a_,” will be 0): 


n—1 


pl = pee + an—jaj)f™. 


j=0 


. ~ ) 
(b) Deduce a formula for fT). 


The formula in part (b) is not going to be used; it was inserted only to con- 
vince you that a general formula for f"* is out of the question. On the 
other hand, as part (c) shows, it is not very hard to obtain estimates on the 
size"or f(x). 


(c) LeteV¥ = maxGlalagly.... lai) hen (e,4 4 a, 14) = 2N?; this 
means that 
n=l 
fede ee Pe where 


ba") 


b'| < 2N?. 
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(€) 


Show that 


n—| 
AG Sal 2 
fur) = yer: where [b;7| < 4N?. 
aay 


and, more generally, 


n—| 
pte = y bi fF, where [b\*| aN 
j=0 


Conclude from part (c) that, for any particular number x, there is a 
number M such that 


Lf" Ox)| < M-2*N! for all k. 
Now suppose that f(0) = f’(0) = --- = f~"(0) = 0. Show that 
M . Qk+T yk+2 | yin tktl 2 M.: (2N.x ete! 
(n+k-+ 1)! ~ (n+k+1)! 


and conclude that f = 0. 
Show that if f; and f> are both solutions of the differential equation 


|f(x)| < 


n—| 


fv _ a fF? 
j=0 
and f;/)(0) = f2/(0) for 0 < j <n —1, then f| = fr. 


In other words, the solutions of this differential equation are determined 
by the “initial conditions” (the values f)(0) for 0 < j <n — 1). This 
means that we can find a// solutions once we can find enough solutions 
to obtain any given set of initial conditions. If the equation 


/ 


x" — a,-,x" ° — +++ -—ag =O0 
has n distinct roots a@,...,@,, then any function of the form 
f(x) =cye™™ + +++ + ene" 


is a solution, and 


f(O) QC) ease Gn: 
f'(O) = A|C| + +! -F'Qnln, 


f"-YO) = ay" 'e ae digg the... 


As a matter of fact, every solution is of this form, because we can obtain 
any set of numbers on the left side by choosing the c’s properly, but we 
will not try to prove this last assertion. (It is a purely algebraic fact, which 
you can easily check for n = 2 or 3.) These remarks are also true if some 


wie EE 
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of the roots are multiple roots, and even in the more general situation 
considered in Chapter 27. 


Suppose that f is a continuous function on [a,b] with f(a) = f(b) 
and that for all x in (a,b) the Schwarz second derivative of f at x is 0 
(Problem 23). Show that f is constant on [a,b]. Hint: Suppose that 
f(x) > f(a) for some x in (a, b). Consider the function 


8(x) = f(x) — e(x —a)(b—x) 


with g(a) = g(b) = f(a). For sufficiently small ¢« > 0 we will have 
8(x) > g(a), so g will have a maximum point y in (a,b). Now use 
Problem 23(c) (the Schwarz second derivative of (x —a)(b— x) is simply 
its ordinary second derivative). 

If f is a continuous function on [a, b] whose Schwarz second derivative 
is 0 at all points of (a, b), then f is linear. 


Let f(x) = x*sin 1/x? for x #0, and f(0) = 0. Show that f =0 up to 
order 2 at 0, even though f”(0) does not exist. 


This example is slightly more complex, but also slightly more impressive, 
than the example in the text, because both f'(a) and f’(a) exist for 
a #0. Thus, for each number a there is another number m(a) such that 


m(a) 


2 


(*) f@) = f(a)+ f(a —a)+ (x — a)? + R,(x), 


0; 


: Ra(x) 
were hine—sas ee 
xa (x — a) 
namely, m(a) = f”(a) fora # 0, and m(0) = 0. Notice that the function 
m defined in this way is not continuous. 
Suppose that f is a differentiable function such that (*) holds for all a, 
with m(a) = 0. Use Problem 27 to show that f’(a) = m(a) = 0 for 
all a. 
Now suppose that (*) holds for all a, and that m is continuous. Prove 
that for all a the second derivative f(a) exists and equals m(a). 


*CHAPTER € IS TRANSCENDENTAL 


The irrationality of e was so easy to prove that in this optional chapter we will 
attempt a more difficult feat, and prove that the number e is not merely irrational, 
but actually much worse. Just how a number might be even worse than irrational 
is suggested by a shght rewording of defmitions. A number x is irrational if it is 
not possible to write x = a/b for any integers a and b, with b 4 0. This is the 
same as saying that x docs not satisfy any equation 
Oia — a) 

for integers a and b, except for a = 0, b = 0. Viewed in this light, the irrationality 
of V2 does not seem to be such a terrible deficiency; rather, it appears that V2 just 
barely manages to be irrational—although V2 is not the solution of an equation 


ayx +ag = 0, 


it ws the solution of the equation 


of one higher degree. Problem 2-18 shows how to produce many irrational num- 
bers x which satisfy higher-degree equations 


i” ki, x el ao 


where the a; are integers not all 0. A number which satisfies an “algebraic” cqua- 
tion of this sort is called an algebraic number, and practically every number we 
have ever encountered is defined in terms of solutions of algebraic equations (7 
and e are the great exceptions in our limited mathematical experience). All roots, 


V2, BolT, 


are clearly algebraic numbers, and even complicated combinations, like 


134/54 ¢14vi+ V6 


are algebraic (although we will not try to prove this). Numbers which cannot be 


such as 


obtained by the process of solving algebraic equations are called transcendental; 
the main result of this chapter states that e is a number of this anomalous sort. 
The proof that ¢ is transcendental is well within our grasp, and was theoretically 
possible even before Chapter 20. Nevertheless, with the inclusion of this proof, we 
can justifiably classify ourselves as something more than novices in the study of 
higher mathematics; while many irrationality proofs depend only on elenientary 
properties of numbers, the proof that a number is transcendental usually involves 
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some really high-powered mathematics. Even the dates connected with the tran- 
scendence of e are nnpressively recent—the first proof that e is transcendental, 
due to Hermite, dates from 1873. The proof that we will give is a simplification, 
due to Hilbert. 

Before tackling the proof itself, it is a good idea to map out the strategy, which 
depends on an idea used even in the proof that e is irrational. ‘Two features of the 


expression 
| | | eB 
a i) i n 
were important for the proof that e is irrational: On the one hand, the number 
| 
Ite t— 
|! n! 


can be written as a fraction p/q with q <n! (so that n!(p/q) 1s an mteger); on the 
other hand, 0 < R, < 3/(n + 1)! (so n!R, is not an integer). ‘These two facts show 
that e can be approximated particularly well by rational numbers. Of course, every 
number x can be approximated arbitrarily closely by rational numbers—if e > 0 
there is a rational number r with |x —r| < ¢; the catch, however, is that it may be 
necessary to allow a very large denominator for r, as large as 1/e perhaps. For e 
we are assured that this is not the case: there is a fraction p/g within 3/(n + 1)! 
of e, whose denominator g is at most n!. If you look carefully at the proof that e 
is irrational, you will see that only this fact about e is ever used. ‘The number e is 
by no means unique in this respect: generally speaking, the betler a number can be 
approximated by rational numbers, the worse it is (some evidence for this assertion 
is presented in Problem 3). ‘The proof that e is transcendental depends on a natural 
extension of this idea: not only e, but any finite number of powers e, e7, ... , e”, 
can be simultaneously approximated especially well by rational numbers. In our 
proof we will begin by assuming that e is algebraic, so that 


(x) Ane" me ee a 0. ag # 0 


for some integers ag, ..- , Gr. In order to reach a contradiction we will than find 
certain integers M, Mj, ... , M, and certain “small” numbers €;, ... , €, such that 
| M,+¢« 
e = ——— ., 
M 
5 Woy a) 
e° = —____, 
M 
n My + €n 
ee" = ———., 
M 


Just how small the €’s must be will appear when these expressions are substituted 
into the assumed equation (*). After nultiplying through by M we obtain 


[agM +a;M,+--- + ay,M, | + [eyay GP 288s Endy, | =), 
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THEOREM 1 


PROOF 


The first term in brackets is an integer, and we will choose the M’s so that it will 
necessarily be a nonzero integer. We will also manage to find €’s so small that 


\€,ay =taiene: + €nQn| << A 


= 


this will lead to the desired contradiction—the sum of a nonzero integer and a 


number of absolute value less than 4 cannot be zero! 


As a basic strategy this is all very reasonable and quite straightforward. ‘The 
remarkable part of the proof will be the way that the M’s and e’s are defined. In 
order to read the proof you will need to know about the gamma function! (This 
function was introduced in Problem 19-40.) 


e is transcendental. 


Suppose there were integers dg, ... , dn, with ag 4 0, such that 
(#) ane” + aye" | +++» +49 = 0. 


Define numbers M, My, ... , M, and €1,... , €) as follows: 


ve | OS er 

: ( &) 

i" | (p — 1)! as 
: = * : . = o~X 

Ek =e | oa 
0 (p — 1)! 


The unspecified number p represents a prime number* which we will choose later. 
Despite the forbidding aspect of these three expressions, with a little work they will 
appear much more reasonable. We concentrate on M first. If the expression in 
brackets, 


[*— 1°. "Ge =n), 
is actually multiplied out, we obtain a polynomial 


ot ee! 


* The term “prime number” was defined in Problem 2-17. An important fact about prime numbers 
will be used in the proof, although it is not proved in this book: If p is a prime number which docs 
not divide the integer a, and which does not divide the integer b, then p also does not divide ab. 
The Suggested Reading mentions references for this theorem (which is crucial in proving that the 
factorization of an integer into primes is unique). We will also use the result of Problem 2-17(d), 
that there are infinitely many primes~~the reader is asked to determine at precisely which points this 
information is required. 
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with integer coefficients. When raised to the pth power this becomes an even 
more complicated polynomial 


xP 4 oe + (nl)? 


Thus M can be written in the form 


np | ~~ 
— ) ——C, xP —l+a9-* dx, 
(=i) Jo 
a=0 


where the Cy are certain integers, and Cp = +(n!)?. But 


>?) 
| xtetdx=ki. 
0 


Thus 
np 
(p—1+a)! 
M= C4 ——--—_.. 
> (p = a eee 0 )! 
a«=0 
Now, for a = 0 we obtain the term 
(p—1)! 
+(niyr (2 = = +(n!)?. 
(p—1)! 


We will now consider only primes p > n; then this term is an integer which is not 

divisible by p. On the other hand, if a > 0, then 
(p—1+a)! 

(p-— 1)! 
which 7s divisible by p. Therefore M itself is an integer which is no? divisible by p. 
Now consider M;,. We have 
co vp-lryy — oe Pp-x 
x (x —1)-...-(« —n)]?’e 
Mae [ST a 
k (p— 1)! 


(ie xP—"i(x — 1)-...+ (x —n)]Pe7@ 
k (p=) 


=Ca(p+a—1)\(p+a—2)-...- p, 


dx. 
This can be transformed into an expression looking very much like M by the 
substitution 


u=x—k 
aa! dx, 


The limits of integration are changed to 0 and oo, and 


M, = ‘hs (u +k N+ k= Vee pha nye" 
0 (p—1)! 


There is one very significant difference between this expression and that for M. 
The term in brackets contains the factor uw in the kth place. Thus the pth power 
contains the factor uv”. ‘This means that the entire expression 


(u +k) [atk —1)+...+(etk—n)]?P 
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is a polynomial with integer coefficients, every term of which has degree at least p. 
Thus 


np 


l _ (po 1 op! l+a)! 
Me) "——_D, P Ita, i Ds ; 
= Oa ie ‘ a as (p— DL 


where the Dy are certain integers. Notice that the summation begins with @ = 1; 
in this case every term in the sum is divisible by p. Thus each M, is an integer 
which 7s divisible by p. 

Now it is clear that 


M, 7 
a at aol ares 
M 


Substituting into (*) and multiplying by M we obtain 
[agM “lz a\M Siaae Ge ieci anM,| te [aye ap OP Se Gn€n| =) 


In addition to requiring that p > n let us also stipulate that p > fag|. This means 
that both M and ag are not divisible by p, so apM 1s also not divisible by p. Since 
each M, is divisible by p, it follows that 


agM +a,;M, +---+a,M, 


is not divisible by p. In particular it is a nonzero integer. 
In order to obtain a contradiction to the assumed equation (*), and thereby 
prove that e is transcendental, it is only necessary to show that 


layey = gaia + Gn€En| 


can be made as small as desired, by choosing p large enough; it is clearly sufficient 
to show that each |e,| can be made as small as desired. ‘This requires nothing more 
than some simple estimates; for the remainder of the argument remember that 7 
is a certain fixed number (the degree of the assumed polynomial equation (*)). To 
begin with, if | < k <n, then 


eRe : iene P| a-x 
aie éf ibe [x -—1)-...-G@—nv)]?’le he 
0 Cog aali)! 
: [ nP—'y — 1)-...+ (x — a) [Pe 
= ——— === Geese 
0 (p —1)! 
Now let A be the maximum of |(x — 1)-...- (x —”)| for x in [0,7]. Then 


| | y etnP-l AP if __ 
Galen, —— eee Cla NX 
(p— 1)! Jo 


en? VA fa 
< ——___ en ae 
(pall leg 


e’nP—l AP 
(p — 1)! 
MP AP e"(n A)? 


= @pe al)! = et 
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But 2 and A are fixed; thus (7 A)?/(p — 1)! can be made as small as desired by 
making p sufficiently large. J 


This proof, like the proof that z is irrational, deserves some philosophic af- 
terthoughts. At first sight, the argument seems quite “advanced” —after all, we 
use mtegrals, and integrals from 0 to oo at that. Actually, as many mathemati- 
clans have observed, integrals can be eliminated from the argument completely; 
the only integrals essential to the proof are of the form 


for mtegral k, and these integrals can be replaced by k! whenever they occur. 
Thus M, for example, could have been defined initially as 


np 
(eae)! 
M= Co : 
du (HN)! 


where Cy, are the coefficients of the polynomial 
ee (ee a)F 


If this idea is developed consistently, one obtains a “completely elementary” proof 
that e is transcendental, depending only on the fact that 


l l l 
aS hint a . 

Unfortunately, this “elementary” proof is harder to understand than the original 
one—the whole structure of the proof must be hidden just to eliminate a few 
integral signs! ‘This situation is by no means pecuhar to this specific theorem— 
“elementary” arguments are frequently more difficult than “advanced” ones. Our 
proof that z is irrational is a case in point. You probably remember nothing 
about this proof except that it involves quite a few complicated functions. There is 
actually a more advanced, but much more conceptual proof, which shows that z 
is franscendental, a fact which is of great historical, as well as intrinsic, interest. One 
of the classical problems of Greek mathematics was to construct, with compass 
and straightedge alone, a square whose area is that of a circle of radius 1. This 
requires the construction of a line segment whose length is /2, which can be 
accomplished if a line segment of length 7 1s constructible. The Greeks were 
totally unable to decide whether such a line segment could be constructed, and 
even the full resources of modern mathematics were unable to settle this question 
until 1882. In that year Lindemann proved that z 1s transcendental; since the 
length of any segment that can be constructed with straightedge and compass can 
be written in terms of +, -, —, +, and / , and is therefore algebraic, this proves 
that a line segment of length z cannot be constructed. 

The proof that a is transcendental requires a sizable amount of mathematics 
which is too advanced to be reached in this book. Nevertheless, the proof is not 
much more difheult than the proof that e is transcendental. In fact, the proof 
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for 7 is practically the same as the proof for e. This last statement should certainly 
surprise you. ‘The proof that e is transcendental seems to depend so thoroughly 
on particular properties of e that it is almost inconceivable how any modifications 
could ever be used for 2; after all, what does e have to do with 2? Just wait and 
see! 


PROBLEMS 


1. (a) Prove that if « > 0 is algebraic, then /a@ is algebraic. 
(b) Prove that if @ is algebraic and r is rational, then @ + r and ar are 
algebraic. 


Part (b) can actually be strengthened considerably: the sum, product, 
and quotient of algebraic numbers is algebraic. ‘This fact 1s too difficult 
for us to prove here, but some special cases can be examined: 


2. Prove that V2 + V3 and V2(1 SP V3) are algebraic, by actually finding 


algebraic equations which they satisfy. (You will need equations of degree 4.) 


*3. (a) Let a be an algebraic number which is not rational. Suppose that @ 
satisfies the polynomial equation 


(GH an Fal a — 0 


and that no polynomial function of lower degree has this property. Show 
that f(p/q) # O for any rational number p/q. Hint: Use Prob- 
lem 3-7(b). 

(b) Now show that | f(p/q)| = 1/q” for all rational numbers p/q with q > 0. 
Hint: Write f(p/q) as a fraction over the common denominator q”. 

(c) Létpwegaagstipl | f'(x)\ |x'=ea| < 1}. Wsethe Mean- ValnerTleorem 
to prove that if p/q is a rational number with ja — p/g| < 1, then 
la — p/q| > 1/Mq”. (It follows that for c = min(1,1/M) we have 
la — p/q| > c/q” for all rational p/q.) 


*4. Let 
a = 0).1 1000 LOODDDOO00000000000 L000... , 


where the 1’s occur in the 7! place, for each n. Use Problem 3 to prove that 
@ is transcendental. (For each n, show that @ is not the root of an equation 
of degree n.) 


Although Problem 4 mentions only one specific transcendental number, it should 
be clear that one can easily construct infinitely many other numbers @ which do 
not satisfy ja — p/q| > e/g" for any ¢ and n. Such numbers were first considered 
by Liouville (1809-1882), and the inequality in Problem 3 is often called Liouville’s 
inequality. None of the transcendental numbers constructed in this way happens to 
be particularly interesting, but for a long time Liouville’s transcendental numbers 
were the only ones known. ‘Vhis situation was changed quite radically by the work 
of Cantor (1845-1918), who showed, without exhibitmg a single transcendental 
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number, that most numbers are transcendental. ‘The next two problems provide an 
introduction to the ideas that allow us to make sense of such statements. ‘The basic 
definition with which we must work is the following: A set A is called countable 
if its elements can be arranged in a sequence 


Gig BD, AZ yGAgnn 


The obvious example (in fact, more or less the Platonic ideal of) a countable set 
is N, the set of natural numbers; clearly the set of even natural numbers is also 
countable: 


2:40; Oe 


It is a httle more surprising to learn that Z, the set of all integers (positive, negative 
and Q) is also countable, but seeing is believing: 


0,1, —I, 2, —2, 3, —3,. 


The next two problems, which outline the basic features of countable sets, are 
really a series of examples to show that (1) a lot more sets are countable than one 
might think and (2) nevertheless, some sets are not countable. 


*5. (a) Show that if A and B are countable, then so is AUB ={x:x isin A or 
xis B}). Hmt: Use the:same trick that worked for Z. 

(b) Show that the set of positive rational numbers is countable. (This is really 

quite startling, but the figure below indicates the path to enlightenment.) 


i= 
Ma 


ty 


ws ae 


2 2 2 2 2 
1 3 4 5 
3 3 3 3 3 
1 2 3 4 5 


(c) Show that the set of all pairs (m,n) of integers ts countable. (This is 
practically the same as part (b).) 
(d) If Ay, Az, A3,... are each countable, prove that 


Aj U AoW Agia 


is also countable. (Again use the same trick as m part (b).) 

(ec) Prove that the set of all triples (/, m,n) of integers is countable. (A triple 
(1, m,n) can be described by a pair (/,m) and a number n.) 

(f) Prove that the set ofall n-tuples (a). a2, .... dn) 1s countable. (If you have 
done part (e), you can do this, usmg induction.) 

(g) Prove that the set of all roots of polynomial functions of degree n with 
integer coefficients 1s countable. (Part (f) shows that the set of all these 
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*6. 


polynomial functions can be arranged in a sequence, and each has at 
most 7 roots.) 
(h) Now use parts (d) and (g) to prove that the set of all algebraic numbers 


is countable. 


Since so many sets turn out to be countable, it is important to note that the 
set of all real numbers between O and | is not countable. In other words, 
there is no way of listing all these real numbers in a sequence 


Gi = Ola ai2aisaine. . 
a2 = 0.a21a22a23a24... 
a3 = 0.431432433034... 


(decimal notation 1s being used on the right). ‘lo prove that this 1s so, suppose 
such a list were possible and consider the decimal 


0.41 1422033444..., 


whore Gym 01! Gineeeennd G,, = Out Gq, —s. Show that this number 
cannot possibly be in the hist, thus obtammg a contradiction. 


Problems 5 and 6 can be summed up as follows. ‘The set of algebraic numbers 
is countable. If the set of transcendental numbers were also countable, then the 


set of all real numbers would be countable, by Problem 5(a), and consequently the 
set of real numbers between 0 and | would be countable. But this is false. Thus, 
the set of algebraic numbers is countable and the set of transcendental numbers 
is not (“there are more transcendental numbers than algebraic numbers”). ‘The 


remaining two problems illustrate further how important it can be to distinguish 


between sets which are countable and sets which are not. 


“He 


Let f be a nondecreasing function on [0,1]. Recall (Problem 8-8) that 

lin f(x) and lim f(x) both exist. 

i ea a 

(a) For any ¢ > O prove that there are only finitely many numbers a in 
(0, 1] with lim f(x)— lm f(x) > e. Hint: There are, in fact, at most 
t= 


ie ath | : Cat 2] 
[ fC) — f(O)]|/e of them. 
(b) Prove that the sect of pomts at which f 1s discontinuous is countable. 
Hint: If lim f(x) — lim f(x) > 0, then it is > 1/n for some natural 


Boer ar Bs geeog 


number n. 


This problem shows that a nondecreasing function is automatically con- 
tinuous at most pomts. For differentiability the situation is more difficult 
to analyze and also more interesting. A nondecreasing function can fail 
to be differentiable at a sct of pomts which is not countable, but it is still 
true that nondecreasing functions are differentiable at most points (in a 
different sense of the word “most”). Reference [38] of the Suggested 
Reading eives a proof using the Rising Sun Lemma of Problem 8-20. 


*8. 


(a) 
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For those who have done Problem 9 of the Appendix to Chapter 11, it 
is possible to provide at least one application to differentiability of the 
ideas already developed in this problem set: If f is convex, then f is 
differentiable except at those points where its right-hand derivative f,’ 
is discontinuous; but the function f,’ is increasing, so a convex function 
is automatically differentiable except at a countable set of points. 


Problem 11-70 showed that if every point is a local maximum point for 
a continuous function f, then f is a constant function. Suppose now that 
the hypothesis of continuity is dropped. Prove that f takes on only a 
countable set of values. Hint: For each x choose rational numbers a, 
and b, such that a, < x < b, and x is a maximum point for f on 
(a,,b,). Then every value f(x) is the maximum value of f on some 
interval (a,, b,). How many such intervals are there? 


(b) Deduce Problem 11-70(a) as a corollary. 


(c) 


Prove the result of Problem 11-70(b) similarly. 


CHAPTER INFINITE SEQUENCES 


The idea of an infinite sequence is so natural a concept that it is tempting to 
dispense with a definition altogether. One frequently writes simply “an infinite 
sequence 

“MCDA ely Veo ot 


the three dots indicating that the numbers a; continue to the right “forever.” A 
rigorous definition of an infinite sequence is not hard to formulate, however; the 
unportant point about an infinite sequence is that for each natural number, 7, 
there is a real number a,. ‘Vhis sort of correspondence is precisely what functions 
are meant to formalize. 


DEFINITION An infinite sequence of real numbers is a function whose domain is N. 


From the pomt of view of this definition, a sequence should be designated by a 
single letter like a, and particular values by 


a(n ak) a). 


but the subscript notation 
a5 GD. G2. - 


is almost always used instead, and the sequence itself is usually denoted by a symbol 
like {a,}. Thus {7}, {(—1)"}, and {1/n} denote the sequences a, B, and y defined 


by 
An =H, 
Br = ( i 
l 
Yn=t- 
i 


A sequence, like any function, can be graphed (Figure 1) but the graph is usually 


rather unrevealing, since most of the function cannot be fit on the page. 


Be {Bn} {Yn} 


wile Reese, mers a fe Ceoonnnnee 


(a) (b) (c) 


DEFINITION 
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-=Ps=f=6 O fo =fs=fo= 


Y4 2 
SS 
0 ys 3 v1 


FIGURE 2 


A more convenient representation of a sequence is obtained by simply labeling 
the points aj, a2, a3, ... on a line (Figure 2). This sort of picture shows where 
the sequence “is going.” The sequence {@,} “goes out to infinity,” the sequence 
{Bn} “jumps back and forth between —1 and 1,” and the sequence {y,} “converges 
to 0.” Of the three phrases in quotation marks, the last 1s the crucial concept 
associated with sequences, and will be defined precisely (the definition is illustrated 
in Figure 3). 


ee : : ip “ an4s ay43 ay41 He a3 a2 ay 


FIGURE 3 


A sequence {a,} converges to / (in symbols lim a, = /) if for every ¢ > 0 there 


is a natural number N such that, for all natural numbers n, 


if n > N, then ja, —l| <e. 


In addition to the terminology introduced in this definition, we sometimes say 
that the sequence {a,} approaches / or has the limit J. A sequence {ap} 1s said 
to converge if it converges to / for some /, and to diverge if it does not converge. 

To show that the sequence {y,} converges to 0, it suffices to observe the following, 
If ¢ > O, there is a natural number N such that 1/N < e. Then, if n > N we 
have 


Yo = pe sO |Yn — O| < «€. 


The limit 


lim A eigen 


will probably seem reasonable after a little reflection (it just says that Yn + I 1s 


practically the same as Vn for large n), but a mathematical proof might not be so 
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obvious. ‘To estimate /n + 1 — Jn we can use an algebraic trick: 


Jeti fa = Ot l=vaolntl t+ yn) 
vntl+vn 


n+l—n” 


Ls =Sre" 
| 
Sore Ee 


It is also possible to estimate Yn + 1 — Vn by applymg the Mean Value Theorem 
to the function f(x) = /x on the interval [n,n + 1]. We obtain 


vn+il—vyvn +1 = he 


= f'(x) 


= . forsome x in (n,n + 1) 


2/n- 


Either of these estimates may be used to prove the above limit; the detailed proof 
is left to you, as a simple but valuable exercise. 
The limit 
3n3+7n?+1 3 
lim ——————}— = = 
noo 4n3—8n+63 4 
should also seem reasonable, because the terms involving n° are the most impor- 
tant when v7 is large. If you remember the proof of Theorem 7-9 you will be able 
to guess the trick that translates this idea into a proof—dividing top and bottom 
by n? yields 


7 | 

: “ 34+-4+ =) 
3n? + 7n- + 1 no 
4n> — 8n + 63 8 63 
4--—4+-— 

n- on 


Using this expression, the proof of the above limit is not difficult, especially if one 
uses the following facts: 


If lim a, and lim b,, both exist, then 
n> oO nu-—>Co 


lim (a, + b,) = lim a, + lim by, 
noo n— OO n—- O&O 


lim (a, -b,) = lim a,- lim by; 
n> OO nN OO n> OO 


moreover, Sim b, #0, then b, 4 0 for all n greater than some N, and 


i- 


lim d,/b, = im. An/ him by. 
no 
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(If we wanted to be utterly precise, the third statement would have to be even 
more complicated. As it stands, we are considering the limit of the sequence 
{cn} = {a,/bn}, where the numbers c, might not even be defined for certain n < N. 
This doesn’t really matter—we could define c, any way we liked for such n— 
because the limit of a sequence is not changed if we change the sequence at a 
finite number of points.) 

Although these facts are very useful, we will not bother stating them as a 
theoren1— you should have no difficulty proving these results for yourself, because 
the definition of lim a, = / is so similar to previous definitions of limits, especially 


N—> CO 
lina fed. 
X— OO 
The similarity between the definitions of lim a, = / and lim f(x) = / is 
nw XO 
actually closer than mere analogy; it is possible to define the first in terms of the 


second. If f is the function whose graph (Figure 4) consists of line segments joining 


FIGURE 4 
the pomts in the graph of the sequence {a,}, so that 


Y Cee Ge 1 — Gn) (% —7t) a, nox = ieee 


then 


lint a, —* ifandeonly i hm f@)—y- 
NO XS CO 


Conversely, it fesatisfies lm: f(x) = 7, and we set a, =o(), then lim ay =¥- 
xXxOoO X—- 00 
This second observation is frequently very useful. For example, suppose that 
OF @—"b. "Then 


bia eae MG 
Nn OO 


To prove this we note that 


lint @ = lie: ee — 10) 
xXx— OO xX- ow 


since loga < 0, so that x loga is a negative and large in absolute value for large x. 


Notice that we actually have 


linieae Oe it ace ale 
hi 


for if a < 0 we can write 


borvia” = thin (1) lal? =0: 
UWF GO N— MO 
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THEOREM 1 


PROOF 


The behavior of the logarithm function also shows that if a > 1, then a” be- 
comes arbitrarily large as n becomes large. ‘This assertion is often written 


hm a" =o, aril, 


n> CoO 


and it is sometimes even said that {a"} approaches oo. We also write equations 
like 


lim —a” = —oo, 
n— OO 


and say that {—a”} approaches —oo. Notice, however, that if a < —1, then in ae 
does not exist, even in this extended sense. 

Despite this connection with a familiar concept, it is more important to visualize 
convergence in terms of the picture of a sequence as pomts on a line (Figure 3). 
There is another connection between limits of functions and limits of sequences 
which 1s related to ¢his picture. ‘This connection is somewhat less obvious, but con- 
siderably more interesting, than the one previously mentioned— instead of defining 
limits of sequences in terms of limits of functions, it is possible to reverse the pro- 
cedure. 


Let f be a function defined in an open interval containing c, except perhaps at c 
itself, with 


hie) — a 
i? Ge 
Suppose that {a,} is a sequence such that 


(1) each a, is in the domain of f, 
(2) weach-ag +c, 


(3) hm a 


Then the sequence { f(a,,)} satisfies 
lim f (@a) =1. 


Conversely, if this is true for every sequence {ay} satisfying the above conditions, 
then lim (x) 78 
ia Ge 


Suppose first that lim j Qs) = Then for every ¢ > Outhere is ad > 0 such thar 
forall “ 
if 0 < |x —c| <6, then |f(x) —1| < ¢. 
If the sequence {a,} satisfies im, a, = c, then (Figure 3) there is a natural num- 
ber N such that, 
ifn > N, then ja, —e| <6. 


By our choice of 6, this means that 


| f (an) oa l| << 135 


a} 


AGU IRIS, 3 


THEOREM 2 


PROOF 
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showing that 
lin Ga, )i— & 


1 >iXD 


Suppose, conversely, that lim f(a,) =/ for every sequence {a,} with lim a, = 
ii —> OS n> OO 


c. If lim f(x) = / were not true, there would be some ¢ > 0 such that for every 


AS aca 


6 > 0 there is an x with 
O<|x—cl| <6 but |f(x)—-l| >. 


In particular, for each n there would be a number x, such that 
l 
O<|x,-—cl<—- but |f(x,)—-l| > e. 
n 


Now the sequence {x,} clearly converges to c but, smce | f(x,) —/| > € for all n, 
the sequence {f(x,)} does not converge to /. This contradicts the hypothesis, so 
lim f Ge) = Must betrue. J 


b Gm ote 


Theorem | provides many examples of convergent sequences. For example, the 
sequences {a,} and {b,} defined by 


] 
so (13 + =) 
ne 
; ] 
BD, 2608 (sin (1 +(—1)"- -)) : 
n 


clearly converge to sin(13) and cos(sin(1)), respectively. It is important, however, 
to have some criteria guaranteeing convergence of sequences which are not obvi- 
ously of this sort. ‘There 1s one important criterion which is very easy to prove, but 
which is the basis for all other results. ‘This criterion is stated in terms of concepts 
defined for functions, which therefore apply also to sequences: a sequence {a,} is 
increasing if a,,,; > ad, for all n, nondecreasing if a,,; > a, for all n, and 
bounded above if there is a number M such that a, < M for all n; there are sim- 
ilar definitions for sequences which are decreasing, nonincreasing, and bounded 
below. 


If {a,} 1s nondecreasing and bounded above, then {a,} converges (a similar state- 
ment is true if {a,} 1s nonincreasing and bounded below). 


The set A consisting of all numbers a, 1s, by assumption, bounded above, so A has 


a least upper bound a. We claim that lim a, = @ (Figure 5). In fact, if e > 0, 
nO 


there is some ay satisfying @ — ay < &, since @ is the least upper bound of A. 
Then if n > N we have 


Gy, ian. SO 0 —Ga= 0 —@y =< &. 


This proves that lm a, = a. f 
li Oo 


458 Infinite Sequences and Infinite Series 


The hypothesis that {a,,} 1s bounded above is clearly essential in Theorem 2: if 
{an} 1s not bounded above, then (whether or not {a@,} 1s nondecreasing) {a,} clearly 
diverges. Upon first consideration, it might appear that there should be little 
trouble deciding whether or not a given nondecreasing sequence {a,} is bounded 
above, and consequently whether or not {a,} converges. In the next chapter such 
sequences will arise very naturally and, as we shall see, deciding whether or not 
they converge is hardly a trivial matter. For the present, you might try to decide 
whether or not the following (obviously increasing) sequence is bounded above: 

hoi+3. 14444. 1454444... 

Although Theorem 2 treats only a very special class of sequences, it is more 
useful than might appear at first, because it is always possible to extract from an 
arbitrary sequence {a,} another sequence which is either nonincreasing or else 
nondecreasing. ‘To be precise, let us define a subsequence of the sequence {a,} 
to be a sequence of the form 

graph of a 
Ging hy 
where the 2; are natural numbers with 


2 and 6 are peak points daly Sed SAB > 


t—t~ Then every sequence contains a subsequence which is either nondecreasing or 
nonincreasing. It is possible to become quite befuddled trying to prove this as- 


sertion, although the proof is very short if you think of the right idea; it is worth 


FIGURE 6 recording as a lemma. 
LEMMA Any sequence {a,} contains a subsequence which is either nondecreasing or non- 
increasing, 
PROOF Call a natural number 2 a “peak point” of the sequence {an} if Gm < dy for all 


m > n (Figure 6). 


Case 1. The sequence has infinitely many peak points. In this case, if ny < ng < 
nee< ces are the peak points, then a4, > @,, > @,, > >" -,/sO0 (@q,) 1s the desired 
(nonincreasing) subsequence. 


Case 2. The sequence has only finitely many peak points. In this case, let ny be greater 
than all peak pomts. Since 71 is not a peak point, there is some #2 > 71 such that 
Gny > Un,. Since nz is not a peak pot (it is greater than 2;, and hence greater 
than all peak poimts) there is some 73 > 22 such that d,, > dy,. Continuing in this 
way we obtain the desired (nondecreasing) subsequence. 


If we assume that our original sequence {a,} 1s bounded, we can pick up an 
extra corollary along the way. 


COROLLARY (THE Every bounded sequence has a convergent subsequence. 
BOLZANO-WEIERSTRASS THEOREM) 


DEFINITION 


THEOREM 3 


PROOF 
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Without some additional assumptions this is as far as we can go: it Is easy to 
construct sequences having many, evenly infinitely many, subsequences converg- 
ing to different numbers (see Problem 3). ‘There is a reasonable assumption to 
add, which yields a necessary and sufficient condition for convergence of any se- 
quence. Although this condition will not be crucial for our work, it does simplify 
many proofs. Moreover, this condition plays a fundamental role in more advanced 
investigations, and for this reason alone it is worth statmg now. 

Ifa sequence converges, so that the individual terms are eventually all close to 
the same number, theu the difference of any two such individual terms should be 
very small. To be precise, if lim a, = / for some /, then for any ¢ > O there is 


Lime al GS 
an N such that la, —/| < ¢/2 forn > N; now if both n > N and m > N, then 


ie: ‘Ss 
lan — ar <a lan = | ai |/ = G,| < > a > = Ee 


This final equality, ja, — a@| < €, which eliminates mention of the limit /, can 
be used to formulate a condition (the Cauchy condition) which is clearly necessary 
for convergence of a sequence. 


A sequence {a,} is a Gauchy sequence if for every ¢ > 0 there is a natural 
number N such that, for all m and n, 


ifi,n > N, then la, —am| <e. 


(This condition is usually written lim [ay — a,| = 0.) 
NCO 


The beauty of the Cauchy condition is that it is also sufficient to ensure conver- 
gence of a sequence. After all our preliminary work, there is very little left to do 
in order to prove this. 


A sequence {a,} converges if and only if it is a Cauchy sequence. 


We have already shown that {a,} 1s a Gauchy sequence if it converges. ‘The proof of 
the converse assertion contains only one tricky feature: showing that every Cauchy 
sequence {a,} 1s bounded. Ifwe take ¢ = | in the defmition of a Cauchy sequence 
we find that there is some N such that 


lin — Gh| <= | form,n > N. 
In particular, this means that 
lam — Gna1| < | for all im > N. 


Thus {a :m > N} is bounded; since there are only finitely many other a;’s the 
whole sequence is bounded. 


460 Infinite Sequences and Infinite Series 


The corollary to the Lemma thus implies that some subsequence of {a,} con- 
verges. 


Only one point remains, whose proof will be left to you: if a subsequence of a 
Cauchy sequence converges, then the Cauchy sequence itself converges. 


PROBLEMS 


1. 


im 


Verify each of the following limits. 


(1) 
(ii) 


(ili) 


*(x) 


Find 


n 


— 


lim 
noo n + 


; 4 j . 
lim Vn2+1— vn + 1=0. Hint: You should at least be able to prove 


Nn xX 


that lim Vn2+1— Vn2=0. 


n> OO 
we! 
lim — 
noo nn” 


lim 
n—- oO 


lim 
n—->%O 


Vn2-+n= 1. 
Va" +b" = max(a,b), 


lim 
n> 


lim 
n—>0o 


a,b>0. 


) : ee ae 
1 —— = 0, where @(n) is the number of primes which divide n. 
n>0oo Nn 


Hint: The fact that each prime is > 2 gives a very simple estimate of 
how small a(n) must be. 
n 
) kP 
at. | 
p+l1 


lim 
noo mPtl 


the following limits. 


f n n+] 
lim _ : 


n>oon+ | n 


lim n—J/n+aVn+h. 


n+ X 
Qn oh (— l y" 
im Ont (= | yr+l . 


(—1)"J/n sin(n") 


lim 

n> oo n+] 
q" — b” 

lim ——— 


n>co qh + pr : 


lim ne", 
n—> oC 


le) <2; 


(vn) 


(a) 
(b) 


(c) 
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qr 

hine —. 

n>oo 7! 
What can be said about the sequence {a,} if it converges and each dp is 
an integer? 
Find! all convergent subsequences of the sequence 1, —1, 1, =1, 1,=1, 
.... (There are infinitely many, although there are only two limits which 
such subsequences can have.) 
Find all convergent subsequences of the sequence I, 2, 1, 2, 3, 1, 2, 
3, 4, 1, 2, 3, 4, 5, .... (There are infinitely many limits which such 
subsequences can have.) 
Consider the sequence 


For which numbers @ is there a subsequence converging to a@? 


Prove that if a subsequence of a Cauchy sequence converges, then so 
does the original Cauchy sequence. 
Prove that any subsequence of a convergent sequence converges. 


Prove thatwil "0 <a = 2ethema < V2a@-< 2. 
Prove that the sequence 


2 2x2 Noy 2 
COnNVerges. 


Find the limit. Hint: Notice that if lnm a, =/, then lim /2a, = v2, 


[fi ee) n— OO 
by Theorem 1. 


Let O < a; < b, and define 


(a) 
(b) 


ayn +b, 
Gheel| Say QDs Brat = a aol 


-_ 


Prove that the sequences {a,} and {b,} each converge. 
Prove that they have the same limit. 


In Problem 2-16 we saw that any rational approximation k// to V2 can be 
replaced by a better approximation (k + 2/)/(A + 1). In particular, starting 
with * =/ =", we obtain 


(a 


Prove that this sequence is given recursively by 


| 
7a nt) = 1+ ——. 
ay An+1 pene 


462 Infinite Sequences and Infinite Serves 


10. 


(b) Prove that lim a, = V2. This gives the so-called continued fraction 
Nn OO 
expansion 
l 
alee 
Nps 
Dk 3 


Hint: Consider separately the subsequences {a2,} and {d2+4}. 
(c) Prove that for any natural numbers a and b, 


- >= b 
at +b=a 4+ ———————— 


b 
ae 
2 C Pont eat 


Identify the function f(x) = lim (him (cosn!mx)*). (It has been mentioned 
hom K-00 


many times m this book.) 


Many impressive looking lhmits can be evaluated easily (especially by the 
person who makes them up), because they are really lower or upper sums mn 
disguise. With this remark as hint, evaluate each of the following. (Warn- 
ing: the list contains one red herring which can be evaluated by clementary 
considerations.) 


Yer Ver4.-.4 Yor 


Q) lim 

n> oO i 
r Ve+ Ver+---+ Ver 
(1) = lim 

Nw nl 


l ] l 
Oy) fe ( n- a (n+ 1) ij = =a) 
ai n A n cud n ) 
- iil LL oe owe —_—_—_——— ie 
( a. (a+1)? (n +2) (n +n)? 


wi) 1 n " n ee n ) 
y fs al en tS eee a eral 
Ue aes fee ere + 22 n? +n? 


Although limits like lim %/n and lim a” can be evaluated using facts about 
: n> W— 00 


the behavior of the logarithm and exponential functions, this approach is 
vagucly dissatisfying, because integral roots and powers can be defined with- 
out using the exponential function. Some of the standard “elementary” ar- 
guments for such limits are outlined here; the basic tools are mequalities 
derived from the binomial theorem, notably 


(+h) > 1+nh, for h > 0; 
and, for part (c), 


— | 5) =a 2 1 
eee > ia forh > 0. 


(l+h)" >l+nh+ 


11. 


| ae 


(b) 
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Prové that lim a® = conf @> I, by.settiuig a= kta, where-> 0. 
hi Oo 


Prove that lim a” =0if 0 <a < I. 
nm OOo 


Prove that lim Ya = 1 if a > 1, bysetting Ya = 1+h and estimating h. 


n> OO 


Prove that lim */a = 1 if 0 <a <1. 


N+ % 


Prove that lim ¢%/n = 1. 


hc 


Prove that a convergent sequence 1s always bounded. 
Suppose that lim a, = 0, and that some a, > 0. Prove that the set of 


n> Co 


all numbers a, actually has a maximum member. 


Prove that 


I 
< log(n + 1) —logn < -. 
a n 


If 
| re | ] 
Net ch eet = = ORM, 


show that the sequence {a,} is decreasmg, and that each a, > 0. It 
follows that there is a number 


| 
y = lim (142-42 tog), 
n ; 


n—>OO 


‘This number, known as Euler’s number, has proved to be quite refractory; 
it is not even known whether y is rational. 


Suppose that f is increasing on [1, 00). Show that 


ey (ea) =| FOG =f 2 Ee): 
I 


Now choose f = log and show that 
oS 


an (rteibya 
<n! < ———_; 
en-l en 
it follows that 
; Vn! 1 
lm = 
n>o Hn e 


This result shows that Vn! is approximately n/e, m the sense that the 
ratio of these two quantities is close to | for large n. But we cannot 
7m this sense; mm fact, this is false. An 
estimate for n!is very desirable, even for concrete computations, because 
n! cannot be calculated easily even with logarithm tables. The standard 
(and difficult) theorem which provides the right information will be found 


in Problem 27-19. 


conclude that n! is close to (1/e) 
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14. + (a) 


FG RSE: 


FIGURE 8 


(d) 
() 


PUG Ras 


Show that the tangent line to the graph of f at (x1, f(41)) intersects the 
horizontal axis at (x2, 0), where 
f(x) 
F'(x1) 

This intersection point may be regarded as a rough approximation to 
the pomt where the graph of f intersects the horizontal axis. If we now 


start at x2 and repeat the process to get x3, then use x3 to get x4, etc., 
we have a sequence {x,} defined inductively by 


(ee) 

f'n) 
Figure 7 suggests that {x,} will converge to a number c with f(c) = 0; 
this is called Nezwton’s method for finding a zero of f. In the remainder 


r= Gl 


Xn+l = Xn 


of this problem we will establish some conditions under which Newton’s 
method works (Figures 8 and 9 show two cases where it doesn’t). A few 
facts about convexity may be found useful; see Chapter 11, Appendix. 
Suppose that f,’ f” > 0, and that we choose x; with f(x) > 0. Show 
thie xo > x7 Sc. 

Let 6, = x, —c. Then 


_ Fx) 
 f(E) 

for some & in (c, x,). Show that 

Fx) — fw) 
Ge: 7 Cer) 


k 


rl 
Conclude that 
i Ge 
eA cic ee on 
I Gor oe 


for some nx mm (c, xg), and then that 


f'n) (re — &) 


i Go 
Opa] < ——6,-. 
(*) kl S Ta k 


Lét.m—Seer—ant f'.on [c, {Mand let VY — sup fon [gail show 
that Newton’s method works if x, — c < m/M. 
What is the formula for x,,) when f(x) =x? — A? 
If we take A = 2 and x; = 1.4 we get 
y= 1.4 
x2 = 1.4142857 
x3 = 1.4142136 
x4 = 1.4142136, 


which is already correct to 7 decimals! Notice that the number of correct 
decinals at least doubled cach time. ‘Phis is essentially guaranteed by the 
inequality (*) when M/m < |. 


FIGURE 10 


15. 


*16, 


i 


*18. 


no; 


20. 


2a 
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Use Newton’s method to estimate the zeros of the following functions. 


(i) f(x) =tanx —cos* x near 0. 
iE 6) = ses es near 0. 
(i) eo ae ee | on | O. le 
(iv) f(x) =x? —3x*41 om (0, 1]: 


Prove that if lim @ =. then 


n— CO 


; a| + oA + Gi 
hm — = —— = i 
N00 n 


Hint: This problem is very similar to (in fact, it is actually a special case of) 
Problem 13-40. 


(a) Prove that if im Gn41 — Qn = 1, then lim a,/n = 71. Hint: See the 


nO 


previous arobiiara. 
(b) Suppose that fois cenunuous and lim f(r!) == ( (x) See Prove 


XO 


that tim fC. =) Eimegyeray ands), be thetinf “andssuprol | on 
[7,7 as TH. 


Suppose that a, > 0 for each vn and that lim a,4)/a, = /. Prove that 
NCO 


hm </a, =/. Hint: This requires the same sort of argument that works in 
Nn O& 


Problem 16, except using multiplication instead of addition, together with 
the fact that lim */a = 1, fora > 0. 


n— OO 


(a) Suppose that {a,} is a convergent sequence of points all in [0, 1]. Prove 


that lm a, is also in [0, 1]. 
HN 8 


(b) Find a convergent sequence {a,} of points all in (O, 1) such that lim a, 
noo 


is mot 11 (0. 1): 


Suppose that f is continuous and that the sequence 


PE LG Wa fel Gl Oe 


converges tod. Prove that dis a “fixed point) for f,1.e., f@) =/1. Hint: Two 
special cases have occurred already. 


(a) Suppose that f is continuous on [0, 1] and that 0 < f(x) < 1 forallx in 
{O. 1]. Problem 7-11 shows that f has a fixed pomt (in the terminology 
of Problem 20). If f is increasing, a much stronger statement can be made: 
For any x in [0, 1], the sequence 


Me AS) fF GO) nee 


has a limit (which is necessarily a fixed point, by Problem 20). Prove this 
assertion, by examining the behavior of the sequence for f(x) > x and 
f(x) < x, or by looking at Figure 10. A diagram of this sort is used 
in Littlewood’s Mathematician’s Mhscellany to preach the value of drawing 
pictures: “For the professional the only proof needed is [this Figure].” 
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*(b) Suppose that f and g are two continuous functions on [0, 1], with 0 < 


f(x) <1 and 0 < g(x), < 1 for all x in [0,1], which satisfy fog = 
go f. Suppose, moreover, that f is increasing. Show that f and g have 
a common fixed point; in other words, there is a number / such that 
f(@) =! = g(1). Hint: Begin by choosing a fixed point for g. 


For a long time mathematicians amused themselves by asking whether the 


conclusion of part (b) holds without the assumption that f is increasing, but 


two independent announcements in the Votes of the American Mathemati- 


cal Society, Volume 14, Number 2 give counterexamples, so it was probably 
a pretty silly problem all along. 


The trick m Problem 20 is really much more valuable than Problem 20 might 


suggest, and some of the most important “fixed pomt theorems” depend upon 
looking at sequences of the form x, f(x), f(f(x)), .... A special, but representa- 
tive, case of one such theorem is treated in Problem 23 (for which the next problem 
is preparation). 


22. 


Tags 


agin 


255 


(a) 


Use Problem 2-5 to show that if c 4 1, then 


cM — cht Il 


cr ia cnt Be hye ct 


l-—c 


(b) Suppose that |c| < 1. Prove that 


(c) 


im c?-++-4 cc" =0. 


mA Oo 


Suppose that {x,} is a sequence with |x, — x,41| < c’, where 0 <c <1. 
Prove that {x,} is a Cauchy sequence. 


Suppose that f is a function on R such that 


(*) lf(x) — f(y)| < clx - yl. for all x and y, 


where c < 1. (Such a function is called a contraction.) 


(a) 


Prove that f is continuous. 
Prove that f has at most one fixed pomt. 
By considering the sequence 


is SO) f oi Cee 


for any x, prove that f does have a fixed point. (This result, 1a more 
general setting, 1s know1 as the “contraction lemma.”) 


Prove that if f is differentiable and | f’| < 1, then f has at most one 
fixed poimt. 

Prove that if | f’(x)| < ¢ < | for all x, then f has a fixed point. 

Give an example to show that the hypothesis | f’(x)| < 1 is not sufficient 
to ensure that f has a fixed pout. 


Vhis problem is a sort of converse to the previous problem. Let dy be a 


Sequénee defined ‘by. by = a,'b,4) = f(b,). Prove that if b = lut) by.exists 


Hh OD 


26. 


Ja 
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and f’ is continuous at b, then |f’(b)| < 1 (provided that we don’t already 
have b; ="b for sonve’n).Srlint? Tf |s4(5) | >Wthens | (40s) > elefor ally an 
an interval around b, and 6, will be in this interval for large enough n. Now 
consider f on the interval [b, b,]. 


This problem investigates for which a > 0 the symbol 
a 


makes sense. In other words, if we define b) = a, by.) = a’, when does 


b= tm J, exist? Note that if b exists, then a? = b by Problem 20. 
ti—> © 


(a) If b exists, then a can be written in the form y!/” for some y. Describe 
the graph of g(y) = y!”” and conclude that 0 <a < e!/*, 

(b) Suppose that | <a < e!/*, Show that {b,} is increasing and also by, < e. 
‘Lhis proves that b exists (and also that b < e). 


The analysis for a < | is more difficult. 


(c) Usmg Problem 25, show that if b exists, then e-! <b <e. Then show 
(hate © Sedans a 


From now on we will suppose that e~® <a < 1. 


(d) Show that the function 


are 


f(x) = 


log x 


is decreasing on the interval (0, 1). 
(e) Let b be the unique number such that a? = b. Show that a < b < 1. 
Using part (e), show that if 0 <x < b, then x <a” <b. Conclude that 


: . li 
f= lim a, @igexistsamal thats —s. 
n> CO 


(f) Using part (e) agam, show that / = b. 
(g) Finally, show that lim a2,42 = b, so that hm b, = b. 
5 ha oS N— Oo 


Let {x,} be a sequence which is bounded, and let 


‘a = sup{Xn, 19 TE em | 


(a) Prove that the sequence {y,} converges. The limit lim y,, 1s denoted by 
= n> OOo i 


lim x, or lim supx,, and called the limit superior, or upper limit, 
fa al®, 2) noo 


of the sequence {x,}. 


(b) Find hm x, for each of the following: 
1] oh @, © 
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28. 


2) 


Gi) x= 1)" [ * -| 
i] 


GC 7. 


(c) Define lim x, (or lim inf x,) and prove that 
n—>0o n= Co 


itn x, = hin %,,2 
n—00 Hi 


(d) Prove that lm x, exists if and only if lim x, = lm x, and that in this 
n—->CO n> co pes 


GAisc mie — Lin Xf — sme. 
CO ii a@, 8; = 
i N®,.9) 


(ec) Recall the definition, in Problem 8-18, of lim A for a bounded set A. 
Prove that if the numbers x, are distinct, then lim x, = Imm A, where 


: ee) 
A= ieee IN}. 


In the Appendix to Chapter 8 we defined uniform continuity of a function 
on an interval. If f(x) is defined only for rational x, this concept still makes 
sense: we say that f is uniformly continuous on an interval if for every € > 0 
there is some 6 > Q such that, if x and y are rational numbers in the interval 


and Ji yjeeo, them | f(x) — f(y) <e. 


(a) Let x be any (rational or irrational) point in the interval, and let {x,} be 


a sequence of rational points in the interval such that lm x, =x. Show 
n> Oo 


that the sequence { f(x,)} converges. 
(b) Prove that the limit of the sequence { f(x,)} doesn’t depend on the choice 
of the sequence {x,}. 


We will denote this limit by f(x), so that f is an extension of f to the 
whole interval. 


(c) Prove that the extended function f 1s uniformly continuous on the imter- 
val. 


Leta > 0, and for rational x let f(«) = a*, as defined in the usual elementary 
algebraic way. ‘Vhis problem shows directly that f can be extended to a 
continuous function f on the whole line. Problem 28 provides the necessary 
machinery. 


(a) For rational...v.<.y,,show thata* < @ for a > J_and a% > a? for.a< 1. 

(b) Using Problem 10, show that for any ¢ > 0 we have |a* — 1| < e for 
rational numbers x close enough to 0. 

(c) Using the equation a* — a? = a*(a*~* — 1), prove that on any closed 
interval f is uniformly continuous, in the sense of Problem 28. 

(d) Show that the extended function f of Problem: 28 is increasing fora > | 
and decreasing for a < | and satisfies f(x + y) = fQ) SO). 


*30. 


31. 


wate Yar 
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The Bolzano-Weierstrass Theorem is usually stated, and also proved, quite 
differently than in the text—the classical statement uses the notion of limit 
points. A point x is a limit point of the set A if for every ¢ > O there is a 
point a in A with |x —a| < e but x 4a. 


(a) Find all limit points of the following sets. 


(i) {oa in NI. 


i 


eae 
(ii) Fe +—:nandm in N}. 


H HT 


Be l 
(111) {—17 [1+ 2] tH in NI. 
n 


(v) Q. 
(b) Prove that x is a limit point of A if and only if for every ¢ > 0 there are 
infinitely many points a of A satisfying |x — a| < €. 
(c) Prove that lim A is the largest limit point of A, and lim A the smallest. 


The usual form of the Bolzano-Weierstrass Theorem states that if A is 
an infinite set of numbers contained in a closed interval [a, b], then some 
point of [a,b] is a limit point of A. Prove this in two ways: 


(d) Using the form already proved in the text. Hint: Since A is infinite, there 
are distinct numbers x1, x2, x3, ... in A. 

(e) Using the Nested Intervals Theorem. Hint: If [a,b] is divided into two 
intervals, at least one must contain infinitely many points of A. 


(a) Use the Bolzano-Weicrstrass ‘Theorem to prove that if f 1s continuous 
on [a, b], then f is bounded above on [a,b]. Hint: If f is not bounded 
above, then there are points x, in [a,b] with f(x,) > u. 

(b) Also use the Bolzano-Weierstrass ‘Vheorem to prove that if f is contin- 
uous on [a,b], then f is uniformly continuous on [a, b] (see Chapter 8, 
Appendix). 


(a) Let {a,} be the sequence 


1 


eo 34 (| 
9 33 


Se ll 2 2 
mr Se Sb Ss 5+ GEG? 


oe) 


zl 
> 4 


Wi bo 


Suppose that 0 < a < b< 1. Let N(m: a,b) be the number of integers 
é os 1 ? \ 
j <asuch that a; ism (a,b). (Thus N(2; 7 aL 2. andy (4 1 2) = 


Prove that 
N(n: a,b) 
lim. —— = b—-a. 
now n 


(b) A sequence {a,} of numbers in [0,1] is called uniformly distributed 
in [O, 1] if 
Niwa, b) 


liny ————. = b =a 
n> oo Wi 
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bast Ko 


(a) 


for all a and b with 0 < a < b < 1. Prove that if s is a step function 
defined on [0,1], and {a,,} is uniformly distributed in [0, 1], then 
.. S(ay) > +S (ap) 
s = lm ————__.. 
0) N—> OO Vl 
Prove that if {a,} is uniformly distributed in [0,1] and f is integrable 
on [0, 1], then 


tHe teats) 
f= im —————_. 
0 


noo n 


Let f be a function defined on [0,1] such that lim f(y) exists for all a 


—a 
in [O, Ll]. For any ¢ > O prove that there are only finitely many points a 
in [O, 1] with | lim f(y) — f(a)| > &. Hint: Show that the set of such 
yoad 
points cannot have a limit point x, by showing that lim f(y) could not 
you 

XIst. 

Prove that, in the terminology of Problem 21-5, the set of points where 
f is discontinuous is countable. This finally answers the question of 
Problem 6-17: If f has only removable discontinuities, then f is contin- 
uous except at a countable set of points, and in particular, f cannot be 
discontinuous everywhere. 


CHARTER 


INFINITE SERIES 


Infinite sequences were introduced in the previous chapter with the specific inten- 
tion of considering their “sums” 


(5) Pp GD =p GR) ap eer 


in this chapter. This is not an entirely straightforward matter, for the sum of 
infinitely many numbers is as yet completely undefined. What can be defined are 
the “partial sums” 


Sn = A ++? + an, 


and the infinite sum must presumably be defined in terms of these partial sums. 
Fortunately, the mechanism for formulating this definition has already been devel- 
oped in the previous chapter. If there is to be any hope of computing the infinite 
sum aj +a2 +43 +--+, the partial sums s, should represent closer and closer ap- 
proximations as #7 is chosen larger and larger. ‘This last assertion amounts to little 
more than a sloppy definition of limits: the “infinite sum” ay +a2+a3+--- ought 


to be lim s,. ‘his approach will necessarily leave the “sum” of many sequences 
noo 


undefined, since the sequence {s,} may easily fail to have a limit. For example, the 
sequence 


1, =-1, lL, =l, 


with a, = (—1)"*! yiclds the new sequence 
Sia — le 
5) = a7 a= 0. 
53 =a; +ar + az=— |, 
sg=arta+aataya=0O0. 


for which lim s, does not exist. Although there happen to be some clever ex- 
ie 


tensions of the definition suggested here (see Problems 12 and 24-20) it seems 
unavoidable that some sequences will have no sum. For this reason, an acceptable 
definition of the sum of a sequence should contam, as an essential component, 
terminology which distinguishes sequences for which sums can be defined from 
less fortunate sequences. 
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DEFINITION 


The sequence {a,} 1s summiable if the sequence {s,} converges, where 


So =@] +2: ae 


In this case, lim 5s, 1s denoted by 
> Oo) 
Oo 


) Gi (or, less formally, aj + a2 +a3+-:- 


n=] 


and 1s called the sum of the sequence {a,}. 


The terminology mtroduced im this definition is usually replaced by less precise 


expressions; indeed the title of this chapter 1s derived from such everyday language. 
lo.) 

An infinite sum ) da, 18 usually called an ifinite series, the word “series” emphasiz- 
n=] 


mg the connection with the mfinite sequence {a,}. ‘Vhe statement that {a,} 1s, or 
oO 


is not, summable is conventionally replaced by the statement that the series ) Ay 
n=l 


does, or does not, converge. ‘This terminology 1s somewhat peculiar, because at 
oO 


best the symbol ) dy, denotes a number (so it can’t “converge”), and it doesn’t de- 


n=] 
note anything at all unless {a,} 1s summable. Nevertheless, this informal language 


is convenient, standard, and unlikely to yield to attacks on logical grounds. 


Certain elementary arithmetical operations on infinite series are direct conse- 
quences of the definition. It is a simple exercise to show that if {a,} and {bn} are 
summable, then 


one) 0° ore 
DEG, +i, > an + Ds ba, 
n=] n=} al 

fone ox 

Spotase ee 

n=l n=l 


As yet these equations are not very interesting, since we have no examples of 
summable sequences (except for the trivial examples m which the terms are even- 
tually all 0). Before we actually exhibit a summable sequence, some general con- 
ditions for summability will be recorded. 


‘There is one necessary and sufficient condition for summability which can be 
stated immediately. ‘The sequence {a,} is summiable if and only if the sequence {5,} 
converges, which happens, according to ‘Theorem 22-3, ifand only if lim_ 8, — 

Mm NOOO 
Sy» = 0; this condition can be rephrased in terms of the original sequence as follows. 


THE CAUCHY CRITERION 


THE VANISHING CONDITION 


23. Infinite Series 473 


‘The sequence {a,} is summable if and only if 


lim aAn+] + mea + Ain = 0. 
[e,@) 


Thi 


Although the Cauchy criterion is of theoretical importance, it is not very useful 
for deciding the summability of any particular sequence. However, one simple 
consequence of the Cauchy criterion provides a necessary condition for summability 
which is too important not to be mentioned explicitly. 


If {a,} is summable, then 


hin — OF 
n— Co 


This condition follows from the Cauchy criterion by taking m =n + 1; it can also 
be proved directly as follows. If lim s, =/, then 


na CO 
teem — Ton (orn ee eben Oe 
n-—-> OO n— Oo n— O&O Leo 


= es) 


Unfortunately, this condition is far from sufficient. For example, lim 1|/n = 0, 
n—- Oo 


but the sequence {1/n} is not summable; in fact, the following grouping of the 
numbers |/n shows that the sequence {s,} is not bounded: 


l l l l ] l l ! l 
rae pine Me Ria aan Geen ee ae ape eo 
a ee ——— 


1 L gill 

=) = = 2 
(2 terms, (4 terms, (8 terms, 
each >t) each > 3) each >t) 


The method of proof used in this example, a clever trick which one might never 
see, reveals the need for some more standard methods for attacking these problems. 


These methods shall be developed soon (one of them will give an alternate proof 
CO 


that De 1/n does not converge) but it will be necessary to first procure a few 
n=l 
examples of convergent series. 
The most important of all infinite series are the “geometric series” 
foe) 
api = eee 
n=0 
Only the cases |r| < 1 are interesting, since the individual terms do not approach 0 
if |r| > 1. These series can be managed because the partial sums 


So leper ese 
can be evaluated in simple terms. The two equations 


= | : 2 wn 
Se lies reir Zach i l 
5) 
Se per? 4 Dew + pp 
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THE BOUNDEDNESS CRITERION 


THEOREM 1 
(THE COMPARISON TEST) 


lead to 
sl = l= 
or 
i= prt 
‘Cr a =o 
(division by | — r is valid since we are not considering the case r = 1). Now 
lim r” = 0, since |r| < 1. It follows that 
i bral @, 2) 
O° n+l 
, eee l 
Sy = hm ——— = —, liecele 
now |—r l-r 
n=0 
In particular, 
oO ] n oO ] n 1] 
“eye 0) =e = 
n=] n=0 2 
that 1s, 
rags le ie 8) 
2 4 8 16 Tn 


an infinite sum which can always be remembered from the picture m Figure 1. 


FIGURE 1 


Special as they are, geometric series are standard examples from which nportant 
tests for summability will be derived. 

For a while we shall consider only sequences {a,} with each a, > 0; such 
sequences are called nonnegative. If {a,} is a nonnegative sequence, then the se- 
quence {s,} 1s clearly nondecreasing. This remark, combined with ‘Theorem 22-2, 


provides a simple-minded test for summability: 


A nonnegative sequence {a,} is summable if and only if the set of partial 
sums Sy is bounded. 


By itself, this criterion is not very helpful deciding whether or not the set of 
all s, is bounded is just what we are unable to do. On the other hand, if some 
convergent series are already available for comparison, this criterion can be used 
to obtam a result whose simplicity belies its importance (it is the basis for almost 
all other tests). 


Suppose that 


= @y=<b) for all n. 


PROOF 


THEOREM 2 
(THE LIMIT COMPARISON TEST) 
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oo oO 
Then if ) b, converges, so does ) An. 


n=1 n=1 


If 
Sy = QA, +--+: +n, 
tn = by +---+bdp, 
then 
0 =s, =f, for all n. 
6o 


Now {t,} is bounded, since ) b, converges. Therefore {s,} is bounded; conse- 


n=l 
ox 


quently, by the boundedness criterion ) a, converges. § 


n=1 


Quite frequently the comparison test can be used to analyze very complicated 
looking series in which most of the complication is irrelevant. For example, 


3 2 + sin?(n + 1) 
2" 4 n2 


converges because 


n 2 = n? 
2” + n- 2 


=. J 
=3 


and 


tm iE: 
NM] vw 
oe) 


n=1 
is a convergent (geometric) series. 
Similarly, we would expect the series 
© 
Eee eT —1+4+sin?n3 


to converge, since the nth term of the series is practically 1/2” for large n, and we 
would expect the series 


. . 9 . . Tr . 
to diverge, since (n + 1)/(n~ + 1) is practically 1/n for large n. ‘These facts can 
be derived immediately from the following theorem, another kind of “comparison 
test.” 


oO oO 
If an, b, > O and lim a,/b, = c # 0, then ) a, converges if and only if ) b,, 
n—> OO 


converges. n=l n=1 
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PROOF 


THEOREM 3 (THE RATIO TEST) 


PROOF 


[e.@) 
Suppose ) b, converges. Since lim a,/b, = c, there is some N such that 
= AS.9) 


eal 
n= oer. forma 
(o@) 
But the sequence eh, certainly converges. ‘Then Theorem | shows that 


n=N 
(ove) ee) 


) a, converges, and this implies convergence of the whole series ) a,, which 


n=N n= 
has only finitely many additional terms. 


The converse follows immediately, smce we also have lim b,/a, = 1/c 4 0. J 
n— 0O 


The comparison test yields other important tests when we use previously an- 
lo. @) 
alyzed series as catalysts. Choosing the geometric series ) r”, the convergent 
n=0 
series par excellence, we obtain the most important of all tests for summability. 


Let a, > 0 for all 1, and suppose that 


~  An+l 
lim =F 
NCO Ay 


[o @) 
‘Then > an converges if r < 1. On the other hand, if r > 1, then the terms a, 


n=1 
foe) 


are unbounded, so ) a, diverges. (Notice that it is therefore essential to compute 


n=l 
lun, a,47/a, and nottlim a,,/a,%!) 
n— oo n—oCo 


Suppose first that r < 1. Choose any number s with r < s < 1. The hypothesis 


An+] 


lim = pre ih 
N>0CO Ay, 
implies that there is some N such that 
a 
oa = forme Ne 
This can be written 
a, te = SQy Pee 
| sie 5 forn >N 


Thus 
an4+1 S San, 


. 
An4+2 S Say41 SS° an, 


AN+k S by 
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je, ¢) o 
Since So ays* = dan Nas converges, the comparison test shows that 
k=0 k=0 
yy an = Yo ans 
n=N k=0 


‘2 2) 
converges. This implies the convergence of the whole series ) Gn: 
n=] 
The case r > | is even easier. If 1 < s <r, then there is a number N such that 


An+] 
>s5 forn > N, 
Qn 
which means that 
AN+k = ays* b= 0: to57 


so that the terms are unbounded. J 


es 
As a simple application of the ratio test, consider the series ) I/n!. Letting 
n=1 
a, = 1/n! we obtain 


| 
Qn41 (n+l)! mi | 
Ay | (n+1)! n+1- 
n! 


Thus 
: An+] 
lim ——1 U8 


m=O Gy, 


oO 
which shows that the series ) | /n! converges. If we consider instead the series 


n=] 
O° 


) r"/n!, where r is some fixed positive number, then 


n=l 
prt ] 
} (n+ 1)! ; r 
lim. ———— = lim = 0); 
noo r n>oon-+ | 
n! 
so r”/n! converges. It follows that 
n=1 
n 
ieee 
lim — =0, 


noo n! 


478 Infinite Sequences and Infinite Series 


THEOREM 4 (THE INTEGRAL TEST) 


PROOF 


a result already proved in Chapter 16 (the proof given there was based on the same 
[e.2) 


ideas as those used in the ratio test). linally, if we consider the series ) nr” we 
n=l 
have 
(n + 1)rt! 


lin. —————— = lim r - 
N00 nr” n—>0o n 


since lim (n + 1)/n = 1. 


This provestthat nf Ose < |athen ) nr” converges, 
n—> CO 5 


n=l 


and consequently 


lim ar” = 0. 
1 teal ®, @) 


(This result clearly holds for —1 <r < 0, also.) It 1s a useful exercise to provide a 

direct proof of this limit, without using the ratio test as an intermediary. 
Although the ratio test will be of the utmost theoretical importance, as a practical 

tool it will frequently be found disappomting. One drawback of the ratio test is 


the fact that lim a,4)/a, may be quite difficult to determine, and may not even 
n> CO 


exist. A more serious deficiency, which appears with maddening regularity, is the 


fact that the limit might equal 1. ‘Phe case lim a,41/a, = | ts precisely the one 


noo 
{a,} might not be summable (for example, if a, = 1/n), 


which is inconclusive: 


. . . 7 . 2 
but then again it might be. In fact, our very next test will show that yo /ny 


nM 
converges, even though 


This test provides a quite different method for determining convergence or diver- 
gence of infinite series—like the ratio test, it is an immeciiate consequence of the 
comparison test, but the series chosen for comparison is quite novel. 


Suppose that f is positive and decreasing on [1], 00), and that f(#2) = a, for all n. 
OO 


Then ) ad, converges if and only if the limit 


i 
OO A 
f= "iin yp 
1 A—>oo 1 
CXIStS. 
‘The existence of Inn e f is equivalent to convergence of the series 
A> ww 


Pramccafecpest oe 


FOG WERSE, 2) 
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Now, since f 1s decreasing we have (Figure 2) 


n+l 


f(n+1)< Ss) 


The first half of this double inequality shows that the series a Qn41) May be com- 


: n=] , 
O° n+l ee. 


OO 
pared to the series ) f, proving that ) ay4) (and hence ) dy) converges 


A n=l *" n=] Hl 
if lim exists. 
A> oo ] 
Ov n+l 


The second half of the inequality shows that the series = f may be com- 


n 
n=] 
Ox 


A oo 
pared to the series ) dn, proving that jm / f must exist if ) a, converges. ff 
eee 
n=1 n=1 


Only one example using the mtegral test will be given here, but it settles the 


question of convergence for infinitely many series at once. If p > O, the conver- 
OO 


gence of ye 1/n? is equivalent, by the integral test, to the existence of 


iI 
Se 
— dx 
ip ee 
Now fs 1 1 . , me 
/ ap ax = (p—1) Ap-| p- iy P 
ie log A, p=, 


A> ov 


A OO 
‘This shows that lim i ] (x e@ieexisisnt = euemot 7 le ints 2 lly aails 
| 


n=l 
9) 


converges precisely for p > 1. In particular, ) 1/n diverges. 
n=] 
The tests considered so far apply only to nonnegative sequences, but nonpositive 
sequences may be handled in precisely the same way. In fact, since 


See 


n=1 n=] 
all considerations about nonpositive sequences can be reduced to questions involv- 
ing nonnegative sequences. Sequences which contain both positive and negative 
terms are quite another story. 
CO 
If yak is a sequence with both positive and negative terms, one cau con- 


fil 
ee 


sider instead the sequeuce ) la, |, all of whose terms are nonnegative. Cheerfully 


n=l 
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DEFINITION 


THEOREM 5 


PROOF 


ignoring the possibility that we may have thrown away all the interesting informa- 
tion about the original sequence, we proceed to eulogize those sequences which 
are converted by this procedure into convergent sequences. 


[e,@) le, 6) 


‘The series ) a, 1s absolutely convergent if the scries ) |a,| 1s convergent. 


n= pS) 
(In more formal language, the sequence {a,} is absolutely summable if the 


sequence {|a,|} 1s summable.) 


Although we have no right to expect this definition to be of any interest, it turns 
out to be exceedingly important. ‘The following theorem shows that the definition 
is at least not entirely useless. 


Every absolutely convergent series is convergent. Moreover, a series is absolutely 
convergent if and only if the series formed from its positive terms and the series 
formed from its negative terms both converge. 


OO 
if ) la,| converges, then, by the Cauchy criterion, 


i 


lim slay ee == lal 0. 


MN OO 
Since 
lQn+1 Sp eo 8 Sp an| < lan+il sea oS ae (25h 


it follows that 
lim dp4p F+++ +m = 0, 


Mm ,N— CO 
io @) 
which shows that ) Ay converges. 


n=! 
To prove the second part of the theorem, let 


4 Ain lag 8 
ay = 


Ome Sala: 
| an, Thong < 0 


in = 


Q, if ay = 0. 
30 the * is the series fi ‘d fr he sitive terms of and = 
so that a, 18 the series formed trom the positive terms o Ayn, anc Bhi; 


n=l n=l n=] 
is the series formed from the negative terms. 


Oo OO 

If > 3: * and ies both converge, then 
n=] n=] 

lo 6) oe OO Q 

iS a lta —(a, )) = Soa,” = Sua 

i= n 


n=] n=] n=l 
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oO 
also converges, so ) a, converges absolutely. 
n=1 
On the other hand, if ) la,| converges, then, as we have just shown, ) An 


n=! n=! 
also converges. Therefore 


y 
) an = 5 ) An + ) lan| 
n=| any n=1| 
and 
x | 0° x 
y a = 5( ) apn — ) nl) 
n=1 n=1 n= 


both converge. § 


It follows from Theorem 5 that every convergent series with positive terms can 
be used to obtain infinitely many other convergent series, simply by putting in 
minus signs at random. Not every convergent series can be obtained in this way, 
however—there are series which are convergent but not absolutely convergent 
(such series are called conditionally convergent). In order to prove this state- 
ment we need a test for convergence which applies specifically to series with positive 
and negative terms. 


THEOREM 6 (LEIBNIZ’S THEOREM) ~— Suppose that 


aj > an >az3>--->0, 


and that 


lim a, = 0. 
na co 


Then the series 


oO 
) (-1)"*!a, = aj — a2 +a3—a4+as5—-:-- 
n=] 
converges. 
PROOF Figure 3 illustrates relationships between the partial sums which we will establish: 

(1) sg < 54 <S6 S- 

(2) sj] > 53 > s5 >-*:, 

(3) se < 8; if k is even and / is odd. 

tach): BAG a ae 
S$? S4 S6 Sg Spo Sy2 S}y So S7 Ss $3 S$] 


FIGURE 3 
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‘To prove the first two inequalities, observe that 


Ch)! S3n42 = Son F Gantt —Gone2 

= S91, since G2n+1 2 A2n+2 
(2) S2n+3 = S2n+1 — 42n42 + 4243 

= Spas SINCE G2n42 > A2n43- 


‘To prove the third inequality, notice first that 


S2n = S2n—1 — 42n 


Sy 55-1 since a2, > 0. 


This proves only a special case of (3), but in conjunction with (1) and (2) the general 
case is easy: if k is even and / is odd, choose n such that 
2n>k and 2n—121; 
then 
Sk = S2n = S$2n-1 SS, 

which proves (3). 

Now, the sequence {s2,,} converges, because it is nondecreasing and is bounded 
above (by s; for any odd /). Let 


@ = sup{s2,} = lim s2,. 
. ; n-> OO 
Similarly, let 
B = inf {s2,41} = lim S2n+1- 
noo 


It follows from (3) that a < B; since 


S2n41 — S2n = G24) and im a= 0 


it is actually the case that a = B. This proves that aw = 6 = lim s,. J 
n> 


The standard example derived from Theorem 6 is the series 
| I | | 
Laas Sate 
x 
which is convergent, but not absolutely convergent (since ) 1/n does not con- 


n=1 
verge). If the sum of this series is denoted by x, the following manipulations lead 


to quite a paradoxical result: 
x=1-—}+}4 
7). es we oe ee 
=I—3- gp aeig 2 tS i ee a 


(the pattern here is one positive term followed by two negative ones) 


=(1-4)-44+6¢-)-§+¢-)-¢4+¢6-ap-bt+°° 

= 1) | l | | l | | 

= 5g ge ea ae ad Te, 
d-$44-}ad-d4)-f49 


THEOREM 7 
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so x = x/2, implying that x = 0. On the other hand, it is easy to see that x 4 0: 
the partial sum s2 equals 4, and the proof of Leibniz’s Theorem shows that x > 52. 
This contradiction depends on a step which takes for granted that operations 


valid for finite sums necessarily have analogues for infinite sums. It is true that the 
sequence 


(bn}=1, —3. - 


on, ee 1 ee ee oe te ik i alee ol ell 

ne pe wes gt Gy (jo 7 Semon Oe ae Pee 

In fact, {b,} is a rearrangement of {a,} i the following precise sense: each 
by = afin) Where f is a certain function which “permutes” the natural numbers, 
that is, every natural number m is f(n) for precisely one n. In our example 


fQm+1)=3m+) (the terms |, i. ee go into the Ist, 4th, 7th, 
places), 

f (4m) = 3m (the terms —i, —%. ype. go mio the 3rd, 6th, 9th, 
sa. Dideesis 

f (4m + 2) =3m+2 = (the terms —5, —i, —+. ... go into the 2nd, 5th, 8th, 
22 Dlages? 


ie.2) ie, 2) 


Nevertheless, there 1s no reason to assume that ) b, should equal ) ay: these 


n=] n=| 
sums are, by definition, lim bj +---+b, and lm a, +---+a4,, so the particular 
N— CO n> CO 
CO 
order of the terms can quite conceivably matter. The series ) (=1)""" /itisaot 
il 


special in this regard; indeed, its behavior is typical of series which are not ab- 
solutely convergent—the following result (really more of a grand counterexample 
than a theorem) shows how bad conditionally convergent series are. 


co 


If ) dn converges, but does not converge absolutely, then for any number @ there 


n=l 
OO 
is a rearrangement {b,} of {a,} such that ) be a. 
(pel 
le. @) re, @ 


Let S Pn denote the series formed from the positive terms of {a,} and let ae 
=! Il 
denote the series of negative terms. It follows from Theorem 5 that at least one of 
these series does not converge. As a matter of fact, both must fail to converge, for 
if one had bounded partial sums, and the other had unbounded partial sums, then 
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~~ 
the original series ) a, would also have unbounded partial sums, contradicting 


n=1 
the assumption that it converges. 


Now let @ be any number. Assume, for simplicity, that a > 0 (the proof for 
oo 


a < 0 will be a simple modification). Since the series ) Pn IS not convergent, 


n=1 
there is a number N such that 


N 
) Dn >a: 


n=] 
We will choose N; to be the smallest N with this property. This means that 


N,-1 


(1) yy Pn <Q, 


n=1 


Ny 
but (2) DP > @. 


n=] 


Then if 


we have 


PN, 
—_ —— 
soph ee 
0 DA chee Pl a Pi +---+ Pn,-1 + Pn, 


FIGURE 4 
This relation, which is clear from Figure 4, follows immediately from equation (1): 
N,-1 
S$} —@ < S$; — ) Pn = PN, 
n=l 
‘To the sum S; we now add on just enough negative terms to obtain a new sum 7] 
which is less than aw. In other words, we choose the smallest integer M, for which 


M, 
T; = S; + PRs <a. 


n=1 
As before, we have 
a — T| < —qm;: 


We now continue this procedure indefinitely, obtaining sums alternately larger 
and smaller than @, each time choosing the smallest Ny or My possible. ‘The 


THEOREM 8 


PROOF 
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sequence 


Pisses PNy> 150+ +>9M,s PNi4+1+-+++ PNos + 


is a rearrangement of {a,}. The partial sums of this rearrangement increase to Sj, 
then decrease to 7), then increase to Sz, then decrease to 72, etc. ‘To complete the 
proof we simply note that |$;—a@| and |7;,—@ 


are less than or equal to py, or —qm,, 


respectively, and that these terms, being members of the original sequence {ap}, 


must decrease to O, since ) a, converges. 


n=l 


Together with Theorem 7, the next theorem establishes conclusively the distinc- 
tion between conditionally convergent and absolutely convergent series. 


If a, converges absolutely, and {b,} is any rearrangement of {a,}, then ) Dy 


n=] n=] 
also converges (absolutely), and 


fone) one) 
T= Wa. 


n=l n=1 


Let us denote the partial sums of {a,} by s,, and the partial sums of {b,} by th. 
[o,) 


Suppose that ¢ > 0. Since ) a, converges, there is some N such that 


n=] 


[o,@) 


) an — SN 


n=! 


<K Eo 


le @) 
Moreover, since ) |a, | converges, we can also choose N so that 


n=1 


[e,@) 
Yo lanl — (ail +++ + lawl) <e, 


n=l 
Le. S@ that 


lanai] + lansa2l + lana3l+--: <. 


Now choose M so large that each of aj,...,ay appear among ),,....by. Then 
whenever m > M, the difference t, — sy is the sum of certain a;, where ay,....aNn 
are definitely excluded. Consequently, 


tm — Syl < lan+1| + lan42l + lana3| +--- - 
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Thus, if m > M, then 


oO 
) an — tn 


oO 
= Da — Sn — (ln — Sn) 


n=1 n=1 
(ove) 
s ae? — Sn | + |tm — Sul 
n=1 
ace Ee 
0° fo 


Since this is true for every ¢ > O, the series ) b, converges to ) as: 


n=] n=1 
fore 


‘To show that Dib converges absolutely, note that {|b,|} is a rearrangement 


n=1 


oO (e, @) 
of { |a,| }; since ) \a,| converges absolutely, ) |b, | converges by the first part of 
n=1 n=1 


the theorem. J 


Absolute convergence is also important when we want to multiply two infinite 
series. Unlike the situation for addition, where we have the simple formula 


oO 


oe fons 
a + pan? = Sian + b,), 


n=| n=l n=1 


there isn’t quite so obvious a candidate for the product 


(Yo«) (Soe) = (aj +a2+---)- (1 +b2+---). 


n=1 n=1 


It would seem that we ought to sum all the products a;b;. The trouble is that these 
form a two-dimensional array, rather than a sequence: 


a,b a,b> a\b, 
arb arb arb, 
ab a3b> a3zb, 


Nevertheless, all the elements of this array can be arranged in a sequence. ‘The 
picture below shows one way of doing this, and of course, there are (infinitely) 
many other ways. 


ayby ayb2 a, b3 
arb, arb» arb; 


a3b, axb a3b3 
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Suppose that {c,} 1s some sequence of this sort, containing each product a;b; 
just once. ‘Then we might amg: expect to have 


y Cn = =55 An * y oe 
n=] n=1 


n=] 
But this zsn’t true (see Problem 10), nor is this really so surprising, since we've said 
nothing about the specific arrangement of the terms. The next theorem shows 
that the result does hold when the arrangement of terms is irrelevant. 


foe) oO 
Ne ) ad, aud ) b, converge absolutely, and {c,} is any sequence containing the 


n=] al 
products a;b; for each pair (i, 7), then 


va = > ie, . Sox 


n=] n=] n=] 


Notice first that the sequence 


E EB 
pr=>_lail- > 1a 
=) i 


converges, since {a,} and {b,} are absolutely convergent, and since the limit of a 
product is the product of the limits. So {pz} is a Cauchy sequence, which means 
that for any ¢ > 0, if L and L’ are large hae then 


San PD De 


i=] iI 


(1) > opel EA & 


i ole jit 


It follows that 


No | 


Now suppose that N is any number so large that the terms c, for n < N include 
every term a,b; for i, j < L. Then the difference 


N 
D2 Cr 5 dj: y bj 


n=] i=] 


consists of terms a,b; with i > L or j > L, so 


N 
» Gi 5 qj: pe b; 


il i=] 


(2) 


=e eT 


Ol.) ae 


<€ by (1). 


But since the limit of a product is the a. of the limits, we also have 


» |y rad hy, >» |- 


i=! 
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for large enough L. Consequently, if we choose L, and then N, large enough, we 
will have 


00 (ore) N fo.) 00 L JE 
Sh — Salis Da Sy be ed 
i=l j=l i= i=l i= i=l =| 
L L N 
+} doa Db) — Doe 
ed | j=) n=] 
as by (2) and (3), 


which proves the theorem. §f 


Unlike our previous theorems, which were merely concerned with summability, 
this result says something about the actual sums. Generally speaking, there is 
no reason to presume that a given infinite sum can be “evaluated” in any simpler 
terms. However, many simple expressions can be equated to infinite sums by using 
Taylor’s Theorem. Chapter 20 provides many examples of functions for which 


n (i) ‘ 
f(x) = »- LO 6, <a a)’ ee Rna(X), 


i! 
i=0 


where lim Rp «(x) = 0. This is precisely equivalent to 
n—->&X 


“ f@), 
i! 


f@)= tim oS (xa), 
i=0 
which means, in turn, that 

vee 

f(a) , 

Kix) = ‘Dearie (x —a)’. 

i=0 
As particular examples we have 
ee ae > x gi 
Sli =x [3° 5 oa 3 
x7 xt “x8 
cosz = =o a @r ; 
ro] x x2 x? x4 
ea l+ 7 = ay + 3 + qn + . 
eye? es ree 
arctanx =x —-—+—>-——>+:::, x| <1, 
3 5 7 
log(1 +x) a 5 op l<x<! 
og x)=x-—+—-—4+—+4+:::, -I<x<l. 
ale ale ee 
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(Notice that the series for arctan x and log(1 +.) do not even converge for |x| > 1: 
in addition, when x = —1, the series for log(] + x) becomes 


which does not converge.) 
Some pretty impressive results are obtained with particular values of x: 


1? 7! 
a 3 Sn 
l l l 
joe tt ot 
1 i ey 
T - 
| ee ee | 
loge Zz = =5 at 


More significant developments may be anticipated if we compare the series for 
sin.x and cos x a little more carefully. ‘The series for cos x is ust the one we would 
have obtained if we had enthusiastically differentiated both sides of the equation 


ee 

we 
term-by-term, ignoring the fact that we have never proved anything about the 
derivatives of infinite sums. Likewise, if we differentiate both sides of the for- 
mula for cos x formally (i.e., without justification) we obtain the formula cos’ (x) = 
—sinx, and if we differentiate the formula for e* we obtain exp’(x) = exp(4). 
In the next chapter we shall see that such term-by-term differentiation of infinite 
sums is indeed valid in certain important cases. 


PROBLEMS 


1. Decide whether each of the following infinite series is convergent or diver- 
gent. The tools which you will need are Leibniz’s ‘Theorem and the compar- 
ison, ratio, and integral tests. A few examples have been picked with malice 
aforethought; two series which look quite similar may require different tests 
(and then again, they may not). ‘The hint below indicates which tests may be 


used. 
: sin ng 
=~ 
n=1 i 
fi) 1-f4g-h4 
ss 17 2. 2... (ao 
i ae es 
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(vi) 


(vii) 


(viii) 


(ix) 


(x) 


(xi) 


(xi) 


(xiii) 


(xiv) 


=. 4 
2 logn- 


= (logn ye 


nlogn- 


l 


a n! 
{XV11) ) —, 
n 


(xix) 


n=1 


oO 
2"n! 


n=1 


n(logn)* 


ae ‘s n2(log n) 


______. (The summation begins with n = 2 simply to avoid 
the meaningless term obtained for n = 1). 


(xx) 
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[o,6) 
ry! 


ne 
n= | 


Hint: Use the comparison test for (1), (v), (vi), (ix), (x), (xi), (xin), (xiv), (xvii); 
the ratio test for (vn), (xviii), (xix), (xx); the integral test for (vim), (xv), (xvi). 


The next two problems examine, with hints, some infinite series that require 


more delicate analysis than those in Problem 1. 


*2. (a) If you have successfully solved examples (xix) and (xx) from Problem 1, 


a. 


(b) 


OO 
it should be clear that ) a"n!/n" converges for a < e and diverges for 


n=1 
CO 


a > e. For a = e the ratio test fails; show that ) e"n!/n” actually 
p= 
diverges, by using Problem 22-13. 
OO 
Decide when ) n"/a"n! converges, again resorting to Problem 22-13 
nl 
when the ratio test fails. 


OO 


Problem | presented the two series Y “(log n)~* and Y “(log nm) "ol whieh 


n=2 n=2 


the first diverges while the second converges. The series 


= | 


DE (log nylogn ] 


2 


which lies between these two, is analyzed in parts (a) and (b). 


(a) 


dO 
Show that f;~ e”/y” dy exists, by considering the series ) (e/n)". 


nl 


(b) Show that 


= | 


DS (log n)logn 


ne 
converges, by using the integral test. Hint: Use an appropriate substitu- 
tion and part (a). 


Show that 


ze | 


» (log n ylog (log n) 


n=! 


diverges, by using the integral test. Hint: Use the same substitution as 
in part (b), and show directly that the resulting integral diverges. 
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4. 


5: 


6. 


; = ] 
Decide whether or not = ——— converges. 
nl l+1/n 
n=1 
OO (o) 
(a) Prove that if a bn converges absolutely, then so does > om” : 
nil p= 


*(b) Show that this does not hold for conditional convergence. 


Let f be a continuous function on an interval around 0, and let a, = f(1/n) 
(for large enough 7). 


OO 
(a) Prove that if ye converges, then f(0) = 0. 


pS ee) 
(b) Prove that if f’(O) exists and Dar converges, then f’(0) = 0. 
n=] 0° 
(c) Prove that if f”’(O) exists and f(0) = f’(O) =0, then De an converges. 
oO n=| 


(d) Suppose ae converges. Must f’(O) exist? 
n= oO 
(ec) Suppose f(0O) = f’(0) = 0. Must Soa, converge? 
n=] 
(8) 
(a) Let {a,} be a sequence of integers with 0 < a, < 9. Prove that a On 


idl 
exists (and lies between 0 and 1). (This, of course, is the number which 


we usually denote by 0.a;a2a3a4....) 
(b) Suppose that 0 < x < 1. Prove that there is a sequence of integers {ay} 


= 
with 0 < a, < 9 and De anne = x. Mint: ghar examplesai— 07 


n=l 
(where [y] denotes the greatest integer which is < y). 


(c) Showlthaimaie {a,} is repeating, t.e.,. is of the orniegi, ax... ar. 
OO 


Gt, @ aa), Ad, «.., then ) d,10-" is a rational number (and find 
il 


it). he same result naturally holds if {a,} is eventually repeating, ie., if 


the sequence {ay4x} is repeating for some N. 
oO 


(d) Prove thatif x = ea, 10~" is rational, then {a,} is eventually repeat- 


n=] 
ing. (Just look at the process of finding the decimal expansion of p/q 


dividing g into p by long division.) 
Suppose that {a@,} satisfies the hypothesis of Leibniz’s Vheorem. Use the 


proof of Leibniz’s ‘Theorem to obtain the following estimate: 


ie. 3) 
YH)" ay = la) — ao + ++: Pay ana 


n=l 
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[o,@) 


9. (ay, Prove thai 7 .2,0.and sini </a, = 7 then ) a, converges if r < I, 


n> 0o 
pl 


and diverges if r > 1. (The proof is very similar to that of the ratio test.) 
OO 


This result is known as the “root test.” More generally, ) dy converges 
n= 


if there is some s < | such that all but finitely many ¢/a, are < s, and 
lo @) 


) a, diverges ifinfinitely many ~%/a, are > 1. ‘Vhis result is known as the 
n=l 


“delicate root test” (there is a similar delicate ratio test). It follows, using 


the notation of Problem 22-27, that ) a, converges if lim */a, < | 


ao 
n=1 
and diverges if hm Van > 1; no conclusion is possible if im ay ale 


(b) Prove that if the ratio test works, the root test will also. Hint: Use a 
problem from the previous chapter. 
It is easy to construct series for which the ratio test fails, while the root 
test works. For example, the root test shows that the series 


I I 2 N22 il 1\3 
mer Me) aly) aril ae 
converges, even though the ratios of successive terms do not approach a 


limit. Most examples are of this rather artificial nature, but the root test 
is nevertheless quite an important theoretical tool. 


n 


10. For two sequences {dy} and {b,}, let c, = ) G;Pyjoee ( Lugitve, isthe sui 


11. 


12. 


k=] 
of the terms on the nth diagonal in the Eee ulUee purges 486.) ‘The series 
oO 


Cy, is called the Cauchy product of a and iy ie — ee — 
n=l n=l n=] 

(—1)"//n, show that |c,| > 1, so that the Cauchy product does not con- 
verge. 


(a) Consider the collection A of natural numbers that do vot contain a 9 in 
their usual (base 10) representation. Show that the sum of the reciprocals 
of the numbers in A converges. Hint: How many numbers between | and 
9 are in A?; how many between 10 and 99?; etc. 

(b) If B is the collection of natural numbers that do not have all 10 digits 
OF... 9 in their usual representation, then the sum of the reciprocals of 
the numbers in B converges. (So “most” integers must have all ten digits 
in their representation.) 

A sequence {a,} is called Gesaro summable, with Cesaro sum /, if 

mci la “isin 
hn — | 


n> 0° n 


(where sx = ay +--+ + a,x). Problem 22-16 shows that a summable sequence 
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13. 


14. 


15. 


16. 


sal: 


*18. 


is automatically Cesaro summable, with sum equal to its Cesaro sum. Find 
a sequence which is not summable, but which is Cesaro summable. 


Suppose that a, > O and {a,} 1s Gesaro summable. Suppose also that the 
fo@) 


sequence {nay} is bounded. Prove that the series a, converges. Hint: If 
dN 
n n 
n : 
= yo ai anc G,.— — > prove that s,, — ——o, 1s bounded. 
= (a n+] 
i= = 


This problem outlines an alternative proof of Theorem 8 which does not rely 
on the Cauchy criterion. 


(a) Suppose that a, > 0 for each n. Let {b,} be a rearrangement of {a,}, 
and let s, =a; +-::+a, and t, = bj) +---+b,. Show that for each n 
there is some m with s, < ty»- 


(b) Show that }° a, < Do by if }° by exists. 


n=1 n=] n=] 
oo) Oo 

(c) Show that » az ‘3 De 
n=1 n=] 


Oo 
(d) Now replace the condition a, > 0 by the hypothesis that ye a, COnverges 
n=! 
absolutely, using the second part of Theorem 5. 
(a) Prove that if DS a, converges absolutely, and {b,} is any subsequence of 


n=] 


OO 
{ay}, then we converges (absolutely). 


n=1| 
(oe 


(b) Show that this is false if Soa, does not converge absolutely. 


n=] 
00 


*(c) Prove that if NE ai, converges absolutely, then 


n=l 
Oo 
Ps, = (aq) ta3+a5s+---)+(arat+ag+ag+-:--). 
n=1 
Prove that if Sian is absolutely convergent, then > ay = De lay|. 
n=] ll ne 


. 6 "OO, 
Problem 19-43 shows that the improper integral fy (sin.x)/x dx converges. 
7OO . . 
Prove that f, |(sin.x)/x|dx diverges. 


Find a continuous function f with f(«) > 0 for all x such that Ss {Ge dx 
exists, but hni f(v) does not exist. 
BS ee ©. 2) 


FEGURE 5 


#19. 


205 


FZ 1, 
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Let f(x) =x sin | /xstor 0 = x =< |, andelet 40) = 0. Recalisthe denmtion 
of (C7, P) irom Problem "3-25. show that tlegset o: all’ (( faaator Fa 
partition of [0,1] is not bounded (thus f has “infinite length”). Hint: ‘Try 
partitions of the form 


I 9D 


Toot ae o Pa 


2 2 
P = 40, ————_ —,1 
(Qn. pal )a a 
es: a 
Let f be the function shown in Figure 5. Find f, f, and also the area of the 
shaded region in Figure 5. 


In this problem we will establish the “bimomial series” 
fe, ©) 
(tx)7=>> Cs Feo 
k—O k 


for any a, by showing that lm R,9(x) = 0. The proof is in several steps, 
n> CO 
and uses the Cauchy and Lagrange forms as found in Problem 20-21. 
~ (a 
(a) Use the ratio test to show that the series Ss ({)r! does indeed converge 
k=0 
for |r| < 1 (this is not to say that it necessarily converges to (1 + r)%). It 
, : : : a 
follows in particular that lim ( yr Alito alpee IE 
n—& il 
(b) Suppose first that 0 < x < 1. Show that lim R,9(+) = 0, by using 
n—>©oO 7 


Lagrange’s form of the remainder, noticing that (1 + rje-2! <= IT for 
n+l>a. 
(c) Now suppose that —1 < x < 0; the number t m Cauchy’s form of the 


remainder satisfies —1 < x < 1+ < 0. Show that 


(oot Pe)e | = el, where M = max(1, (1 + .x)*~!), 
and 
MoS it 


l—t/x 
= {53 | |= |= iba 
l+1 1+ 


Using Cauchy’s form of the remainder, and the fact that 


I( at yy ae 
3b n+l ory 


show that hm R,.o9(x) = 0. 


N— CO 


(a) Suppose that the partial sums of the sequence {a,} are bounded and that 


ZS 
{b,} is a sequence with b, > by; and lim b, = 0. Prove that ) aD. 
i> © 
j=l 


converges. ‘This is known as Durchlet’s test. Hmt: Use Abel’s Lemma 
(Problem 19-36) to check the Cauchy criterion. 
(b) Derive Leibniz’s Theorem from this result. 


496 Infinite Sequences and Infinite Series 


Coe 


bgp 


220. 


#26. 


27. 


OO 
(c) Prove, using Problem 15-33, that the series ) (cosnx)/n converges if x 


n=] 
is not of the form 2kz for any integer k (in which case it clearly diverges). 
OO 
(d) Prove Abel’s test: If ) a, converges and {by} is a sequence which is either 


n=] 
© 


nondecreasing or nonincreasing and which is bounded, then ) O,Dy 


n=] 


converges. Hint: Consider b, — b, where b = hm Dy. 
: n—0o 


lo, @) 


Suppose {a,} 1s decreasing and lim a, = 0. Prove that if ay converges, 
n> Oo 
1 


a= 
[o,@) 


then ) 2" aq also converges (the “Cauchy Condensation Theorem”). No- 


n=\ 


tice that the divergence of ) [/n is a special case, for if ) 1 /n converged, 


n=l n=l 
OO 
then ) 2"(1/2") would also converge; this remark may serve as a hint. 
ri 
OO ©) CO 
2 2 
(a) Prove that if ) i. ald ) b,” converge, then ) dnb, converges. 
rll i=! n= 
OO lee) 
Os a 
(b) Prove that if ) ay” converges, then ) a,/n® converges for any @ > a 
n=1 n=1 
CO 
Suppose {a,,} is decreasing and each a, > 0. Prove that if a, converges, 
te oD 3 
n=1 
then hm na, = 0. Hint: Write down the Cauchy criterion and be sure to 
{fi 10) 


use the fact that {a,} 1s decreasing. 


[o @) 
If ) a, converges, then the partial sums s,, are bounded, and hm a, = 0. 


Nl CO 
n=1 
It is tempting to conjecture that boundedness of the partial sums, together 
CO 
with the condition hm a, = 0, imphes convergence of the series ) Gin 
N00 
n=l 


Find a counterexample to show that this is nof true. Hint: Notice that some 
subsequence of the partial sums will have to converge; you must somehow allow 
this to happen, without letting the sequence of partial sums itself converge. 
ore © 
Prove that ifa, > 0 and > a, diverges, then » 
n=] n= 


Compare the partial sums. Does the converse hold? 


ay . e 
—— also diverges. Hint: 
+ Ay 
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oo 


28. For b, > O we say that the infinite product I] b,, converges if the sequence 


n=] 


converges, and also lim p, 4 0. 
no 


Pr = I] b; 


i! 


fe ©) 
(a) Prove that if I] b, converges, then b, approaches 1. 
n=1 
(b) Prove that I] b, converges if and only if ) log b, converges. 
n=1 


n=1 


oo ~~ 
(c) For a, = 0, prove that ] [a + a,) converges if and only if ) a, con- 
n=1 


n=! 


verges. Hint: Use Problem 27 for one implication, and a simple estimate 


for log(1 + a) for the reverse implication. 
The remaining parts of this Problem show that the hypothesis a, > 0 is 


needed, 
Use the Taylor series for log(1 + x) to show that for sufficiently small x 
2 


we have 
1<y-lo (1+<x)< 3y 
q* = S A] SSG? 
oS 


Conclude that if all a, > —1 and ) a, converges, then the series 


n=1 


n=! 


x~ oO 
. “c 9 ~e . 
) log(1 + a,) converges if and only if ) a, converges. Similarly, 


n=! 


n=] 
gs 9 ) . 
if all a, > —1 and ) a,” converges, then ) log(1 + a,) converges if 
n=1 
>, 3) 
and only if ) a, converges. Hint: Use the Cauchy criterion. 


n=l 


(e) Show that 
/n 


converges, but 


diverges. 
Consider the sequence 
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29. 


30. 


oO 


(compare Problem 26). Show that ) a, diverges, but 


n=1 
oo 
| [dG +4) =1. 
n=l 
~ | 
(a) Compute I] (1 — =). 


5 


n= 


© 
(b) Compute [ [a + x°") for Ix| < 1. 
n=| 
o. 
The divergence of De I/n is related to the following remarkable fact: Any 
n=] 
positive rational number x can be written as a finite sum of distinct numbers 
of the form 1/n. The idea of the proof is shown by the following calculation 
for a: Since 


ot) kb 23 
31 2: — 62 
a Sm 
a 3 Tes 
Ene | aby 
186 + 49°" 36 
Si ee oe 
iso 2 «Ie 
we have 
27 | l l | 
25 ee a a GU C02 © 


Notice that the numerators 23, 7, 1 of the diflerences are decreasing. 


(a) Prove that if 1/(n+1) < x < I/n for some n, then the numerator in this 
sort of calculation must always decrease; conclude that x can be written 


as a finite sum of distinct numbers 1/k. 
oo 


(b) Now prove the result for all x by using the divergence of ) L/n. 


n=1 


FIGURE t 


fi 


CHAPTER 


fa 


UNIFORM CONVERGENCE AND 
POWER SERIES 


The considerations at the end of the previous chapter suggest an entirely new way 
of looking at infinite series. Our attention will shift from particular infinite sums 
to equations like 


*— | x a 
eg = + ot 


which concern sums of quantities that depend on x. In other words, we are 
interested in functions defined by equations of the form 


FO SO) + y+ 


(in the previous example f(x) = x"7!/(n — 1)!). In such a situation {f,} will be 
some sequence of functions; for each x we obtain a sequence of numbers { f,(+)}, 
and f(x) is the sum of this sequence. In order to analyze such functions it will 
certainly be necessary to remember that each sum 


fix) + folx) + fax) +--> 


is, by definition, the limit of the sequence 


OC! ce ewe Ca) Ge 2 FE P30), mmc ; 
If we define a new sequence of functions {s,,} by 
Sp = fp +o: ae 
then we can express this fact more succinctly by writing 


Fix) = lim o(: 


n—-Co 


For some time we shall therefore concentrate on functions defined as limits. 
ney Alin) F(X) 
li-—> CO 


rather than on functions defined as infinite sums. The total body of results about 
such functions can be summed up very easily: nothing one would hope to be 
true actually is—instead we have a splendid collection of counterexamples. The 
first of these shows that even if each f, is continuous, the function f may not be! 
Contrary to what you may expect, the functions f;, will be very simple. Figure | 
shows the graphs of the functions 


if a= 


— 


IV IA 
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These functions are all continuous, but the function f(x) = lim f,(x) is not 
n— 00 


continuous; in fact, 


C8) 
l, « 


be || 


hinge) — | 
ti —> GX) 


IV IA 


Another example of this same phenomenon is illustrated in Figure 2; the func- 
tions f, are defined by 


l 
= ae < — 
n 
FIGURE 2 I 
ufial( Xa) = Se peg gee 
n n 
l 
le —<x 
Hn 


In this case, if x < 0, then f,,(%) is eventually (1.e., for large enough 7) equal to —1, 
and if x > 0, then f,(x) is eventually 1, while /,(0) = 0 for all ». Thus 


—-!l, x <0 
lity fe Cee > 10: ro) 
nds 1, xO: 


so, once again, the function f(x) = hm f,(x) 1s not continuous. 
n— Oo 


By rounding off the corners in the previous examples it 1s even possible to 
FIGURE 3 produce a sequence of differentiable functions {f,} for which the function f(x) = 


lim f(x) is not continuous. One such sequence is easy to define explicitly: 
1N—> 00 


| 
=| x<-- 
H 
Ae sin (“*) OF ee 
i 2 ; ia ee th 
| 
lr — <x 
a 


These functions are differentiable (Figure 3), but we still have 


—l, x <0O 
him fn(x) = 4 0, : = () 
a Ile ie U: 


Continuity and diflerentiability are, moreover, not the only properties for which 
problems arise. Another difficulty is illustrated by the sequence {f,} shown m 
Figure 4; on the interval [0,1/n] the graph of f, forms an isosceles triangle of 
altitude vn, while f, (4) = 0 for x > 1/n. These functions may be defined explicitly 
HIGURE 4 as follows: 


Halu RSS 


FIGURE 6 
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| 
Pn2x, O<x< — 
2n 
F | ] 
Lit i= }, JHEA2h sk, = ee 
2n n 
l 
0, — Sie= |), 
n 


Because this sequence varies so erratically near 0, our primitive mathematical 


instincts might suggest that lim f(x) does not always exist. Nevertheless, this 
n> ow 


limit does exist for all x, and the function f(x) = lim f,(x) is even continuous. 
Tf fame AO. 


In fact, if x > 0, then f,,(x) is eventually 0,so lim f,,(x) = 0; moreover, f,(0) = 0 


NCO 
for all n, so that we certainly have lim f,(0) = 0. In other words, f(x) = 
n— 
lim f(x) = O for all x. On the other hand, the integral quickly reveals the 
I> OO 


strange behavior of this sequence: we have 


| 
| flr) die= os, 
0 
but , 
/ (Coes 0), 
0 
Thus, 


N—> OO 


l | 
im f SEG Aye ke z | lim, Oadiads. 
0 Sy Q 2-0 


This particular sequence of functions behaves in a way that we really never 
imagined when we first considered functions defined by limits. Although it is true 
that 


i) = Md for each x imi [O) hy, 
n—>O& 


the graphs of the functions f, do not “approach” the graph of f in the sense of 
lying close to 1it—if, as in Figure 5, we draw a strip around f of total width 2e (al- 
lowing a width of € above and below), then the graphs of f, do not he completely 
within this strip, no matter how large an n we choose. Of course, for each x there 
is some N such that the point (x. f,(«)) hes in this strip for n > N; this assertion 
Just amounts to the fact that im n(x) = f(x). But it is necessary to choose larger 
and larger N’s as x is chosen closer and closer to 0, and no one N will work for 
all x at once. 

The same situation actually occurs, though less blatantly, for each of the other 
examples given previously. Figure 6 illustrates this point for the sequence 


ye AG 


Ge 3 
me ae | rs 


; I 


A strip of total width 2¢ has been drawn around the graph of f(x) = lim f, (+). 
NO 


If ¢ < 4, this strip consists of two pieces, which contain no points with second 
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GU Rey 


coordinate equal to 4; since each function f, takes on the value }, the graph of 


each f;, fails to he within this strip. Once again, for each point x there is some N 
such that (x. f,(¥)) lies in the strip for n > N; but it is not possible to pick one N 
which works for all x at once. 

It is easy to check that precisely the same situation occurs for each of the other 
examples. In each case we have a function f, and a sequence of functions { f,}, 
all defined on some set A, such that 


i lim ee) for all x in A. 


‘This means that 


for all ¢ > O, and for all x in A, there is some N such that if n > N, then 


I f(x) = Tin(X)| << a, 


But in each case different N’s must be chosen for different x’s, and in each case it 
is not true that 


for alle > O there is some N such that for all x in A, if m >"8N, then 


| yg) at) eee 


Although this condition differs from the first only by a minor displacement of the 
phrase “for all x in A,” it has a totally different significance. Ifa sequence { fi} 
satisfies this second condition, then the graphs of f, eventually he close to the 
graph of f, as illustrated in Figure 7. ‘This condition turns out to be just the one 
which makes the study of limit functions feasible. 


DEFINITION Let {f,} be a sequence of functions defined on A, and let f be a function which 
is also defined on A. Then f is called the uniform limit of {/f,} on A if for 
every € > 0 there is some N such that for all x in A, 


mes NV, then | f@) — Fal =e. 


We also say that { f,} converges uniformly to f onA, or that f,, approaches 
f uniformly on A. 


As a contrast to this definition, if we know only that 


(COS Re) fog.cach x mA, 


then we say that { f,} converges pointwise to f on A. Clearly, uniform conver- 
gence imphes pointwise convergence (but not conversely!). 

Evidence for the usefulness of uniform convergence is not at all difficult to amass. 
Integrals represent a particularly easy topic; Figure 7 makes it almost obvious that 
if {fn} converges uniformly to f, then the integral of f, can be made as close 
to the integral of f as desired. Expressed more precisely, we have the following 


theorem. 


THEOREM 1 


PROOF 


THEOREM 2 


PROOF 
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Suppose that {f,} is a sequence of functions which are integrable on [a,b], and 
that {f,} converges uniformly on [a,b] to a function f which is integrable on 
asda). when 


fl if = him fe 
i= OO 
Let e > 0. There is some N such that for all 2 > N we have 


Oa fi) = for all x in [a, bd]. 


Thus, if 2 > N we have 
b b 


J eda — eos 


a a 


b 
/ Ff On =n) ax 


a 


b 
aS i If x) — frlx)| dx 


a 


b 
< il Edx 
a 


= e(b—a). 


Since this is true for any e€ > 0, it follows that 
b 


N&O 


b 
f = lim fr. 4 
a 


a 


The treatment of continuity is only a little bit more difficult, involving an 
“¢/3-argument,” a three-step estimate of | f(x) — f(w +/)|. If (f,} 1s a sequence 
of continuous functions which converges uniformly to f, then there is some n such 
that 


GQ) If@) = fall < = 


is) 


(2) Oe Os tat) 


es) 


Moreover, since f, 1s continuous, for sufficiently small 4 we have 


GB) frie) = fue FA < 5. 


It will follow from (1), (2), and (3) that | f(x) — f(x +/)| < e. In order to obtain (3), 
however, we must restrict the size of |A| in a way that cannot be predicted until n 
has already been chosen; it is therefore quite essential that there be some fixed n 
which makes (2) true, no matter how small |h| may be—it is precisely at this poit 
that uniform convergence enters the proof. 


Suppose that {f,} 1s a sequence of functions which are continuous on [a,b], and 
that {f,} converges uniformly on [a,b] to f. Then f is also continuous on [a, b]. 


For each x in [a,b] we must prove that f is continuous at x. We will deal only 
with x mn (a,b); the cases x = a and x = b require the usual simple modifications. 
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Let ¢ > 0. Since {f,} converges uniformly to f on [a, b], there is some n such 
that 


f(y) — fr) < 5 for all y in [a, b]. 


In particular, for all A such that x + h is in [a,b], we have 
& 
3 
(2) f(x +h) — file +h)| < 


(1) If(x) — fn(®)| < 
€ 
3 


Now fy is continuous, so there is some 6 > 0 such that for |h| < 5 we have 


5 k E 
(3) lfn(x) — fr(® +h)| < 3 


Thus, if || < 6, then 
|\f(x+h)— f(x)| 
= lf (x 7 h) = Tn (& + h) =i Y £6 Ge so h) = Fax) i fr) 7 F(x)| 
lf@ +A) — f@+h)| + li@ +4) — fp.) + lAG@)—fe)| 


lA 


een 
aor 
=e. 


This proves that f is continuous at x. J 


FIGURE 8 


After the two noteworthy successes provided by Theorem 1 and Theorem 2, 
the situation for diflerentiability turns out to be very disappointing. If each f, is 
differentiable, and if {f,} converges uniformly to f, it is still not necessarily true 
that f is diflerentiable. For example, Figure 8 shows that there is a sequence of 
differentiable functions {f,} which converges uniformly to the function f(x) = |x|. 


THEOREM 3 


PROOF 
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Even if f zs differentiable, it may not be true that 
cor lime f, x): 
noo 
this is not at all surprising if we reflect that a smooth function can be approximated 


by very rapidly oscillating functions. For example (Figure 9), if 


ees 
ee) ae SIT) 
n 
then { f,} converges uniformly to the function f(x) = 0, but 


fal (x) = ncos(n7x), 


and lim ncos(n?x) does not always exist (for example, it does not exist if x = 0). 
n—0oo 


1 


Sn 


AAAANANAAAAAAAAAAMAAAAAAAAAAASAMMULLAAANAAAAMUMUULLAAAAAAAA 
VV 


FIGURE 9 = 


Despite such examples, the Fundamental Theorem of Calculus practically guar- 
antees that some sort of theorem about derivatives will be a consequence of Theo- 
rem |; the crucial hypothesis is that { f,’} converges uniformly (to some continuous 
function). 


Suppose that {f/,} 1s a sequence of functions which are differentiable on [a, b], 
with integrable derivatives /f,’, and that { f,,} converges (pointwise) to f. Suppose. 
moreover, that { f,'} converges uniformly on [a, b] to some continuous function g. 


Then f is differentiable and 
Veo — lin i (EOL 
n—>>CoO 


Applying Theorem | to the interval [a, x], we see that for each x we have 


= him [frQ) — fala] 
=m} (yj (a): 
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Since g is continuous, it follows that f’(x) = g(x) = lim f,’(x) for all x in the 
Li eS 
interval [a,b]. J 


Now that the basic facts about uniform limits have been established, it is clear 
how to treat functions defined as infinite sums, 


Pi ofa) + folx) ta $300) Fe -> 
This equation means that 
f(x) = him fi) +++ + fad: 
our previous theorems apply when the new sequence 


ee ek fat ek a 


converges uniformly to f. Since this is the only case we shall ever be interested 
in, we single it out with a definition. 


oC 


DEFINITION ‘The series fn converges uniformly (more formally: the sequence { f,,} is 


n=1 
uniformly summable) to f on A, if the sequence 


ees J. fi + Yoke 


converges uniformly to f on A. 


We can now apply each of Theorems 1, 2, and 3 to uniformly convergent series; 
the results may be stated in one common corollary. 


OO 
COROLLARY Let 2 fn converge uniformly to f on [a, b]. 
i=l 
(1) Ifeach f, is continuous on [a,b], then f is continuous on [a, bd]. 
(2) If f and each f, ts integrable on [a, b], then 


b Co pb 
= 
a heel a 
CO 
Moreover, if Ss fn converges (pointwise) to f on [a,b], each f, has an integrable 


n=] 
eo) 


derivative f,,’ and ) fn converges uniformly on [a,b] to some continuous fune- 


n=] 
tion, then 


Ce Co pate for all x in [a,b]. 


n=] 
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PROOF — (1) Ifeach f,, ts continuous, then so is each f} +--+ fp, and f is the uniform limit 
of the sequence fi, fi + fo, fit fo + fz, ..., 80 f 1s continuous by Theorem 2. 


(2) Since fi. fit fo, fit fot fs, ... converges uniformly to f, it follows from 
Theorem | that 


b b 
[ faim [itt i 


b b 
= Jim ( fit f fn) 
n— 00 £ , 


= is 


a 
n=] 


(3) Each function fj +---+ fy is differentiable, with derivative fj’ +---+ fh’, 
and fi’, fi’+ fo’, fi’ + fo’ + f3’, ... converges uniformly to a continuous function, 
by hypothesis. It follows from Theorem 3 that 


fx) = lim [fil) #2 + fr] 


= ‘5 fo: | 


n=1 


At the moment this corollary 1s not very useful, smece it seems quite difficult to 
predict when the sequence fi, fi+f2, fitfot+f,... will converge uniformly. The 
most important condition which ensures such uniform convergence 1s provided by 
the following theorem; the proof is almost a triviality because of the cleverness 
with which the very simple hypotheses have been chosen. 


THEOREM 4 Let {fr} be a sequence of functions defined on A, and suppose that {M,,} is a 
(THE WEIERSTRASS M-TEST) — sequence of numbers such that 


Wce)| UV for allay mA. 


Suppose moreover that ) M,, converges. Then for each x in A the series ) eS) 


= nl 
O° 


converges (in fact, it converges absolutely), and ) fn converges untformly on A 
heal 
to the function 


0° 


i) ye: 


A= 
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co 
PROOF _ For each x in A the series om | fn(x)| converges, by the comparison test; conse- 


n=l 
Lo, 2) 


quently > fn(x) converges (absolutely). Moreover, for all x in A we have 


n=1 


YS fn) 


n=N+1 


y | fn(x)| 


n=N+1 


© 
YS Mn. 


n=N+1 


| fx) —[fiv) +---+ fv@)] | = 


lA 


lA 


ie 4) Lo, ©) 
Since ) M,, converges, the number ) M,, can be made as small as desired, 
n=l n=N+1 


by choosing N sufficiently large. J 


f(x) = {x} 


filx) = ${2x} FIGURE 10 


The following sequence {f,} illustrates a simple application of the Weierstrass 
M-test. Let {x} denote the distance from x to the nearest integer (the graph of 
(a) f (x) = {x} is illustrated in Figure 10). Now define 


l 
fn) = Jor 110"! 
The functions f; and f2 are shown in Figure 11 (but to make the drawings simpler, 
10” has been replaced by 2”). ‘This sequence of functions has been defined so that 
the Weierstrass M-test automatically applies: clearly 


] 
| fn(x)| < 10" for all x, 


FIGURE II 


THEOREM 5 


PROOF 
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9° © 
and ) 1/10" converges. Thus ) fn converges uniformly; since each f, 1s con- 


il n=] 
tmuous, the corollary imphes that the function 


(o.@) 


fe) = fey = Ho's) 


n=! n=1 


is also conunuous. Figure 12 shows the graph of the first few partial sums fy + 


-++ + fn. As nm increases, the graphs become harder and harder to draw, and 
OO 


the infinite sum ) fn IS quite undrawable, as shown by the following theorem 


ill 
(included mainly as an interesting sidelight, to be skipped if you find the going too 


rough). 


The function 
ik 


f= Des pr {10's} 


n=1 


is continuous everywhere and differentiable nowhere! 


We have just shown that f is continuous; this is the only part of the proof which 
uses uniform convergence. We will prove that f is not differentiable at a, for 
any a, by the straightforward method of exhibiting a particular sequence {h,,} 
approaching 0 for which 

. J Catan) et @ 

hm 


moo Fey, 
does not exist. It obviously suffices to consider only those numbers a satisfying 
O<a<l. 


Suppose that the decimal expansion of a is 
(a= 00 00030m... 
et 1, = wlOe” wf Go 4oore9, butlet hg = —1LOp” ihane—eb or 9 (the reasom for 
these two exceptions will appear soon). ‘Then 


[e 6) 


fla ee /e,) = f(a) _ 3 


h m 


1 {10"(a + hy} — (10"a} 
10" +10-" 


n= 


=) > £10" "[{10"(a + fim)} — (10"a}]. 


n=] 
This infinite series 1s really a finite sum, because if n > m, then 10”"/,, is an integer, 
Se) 
{10"(a +h,,)} ~ {10"a} = 0. 


On the other hand, for n < m we can write 


10%a. = mteger + 0.4, 16n4.2G)43 . .- Gn - 
bO"(Get tig) = itevern--0.a, aya. (Gye ),.. . 
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(in order for the second equation to be true it 1s essential that we choose hy, = 
—10-" when a, = 9). Now suppose that 


tol — 


ONG 142043 Per. aa 
Then we also have 


1 
0.4 yn 4.10n4+24n+43 Beet ae) aoe 7 


(m the special case m = n+ 1 the second equation is true because we chose 
1 nF ie hn = —10-" when a,, = 4). This means that 
4 
fi Wes, = (Okay eet aa 
and exactly the same equation can be derived when 0.4) 414)424n43-.. > 5. ‘Thus, 
// for n < mn we have 


10"-"[{10"(a + hin)} — (10"a}] = 41. 


i 
(a) : 


In other words, 

flathay— f(e® 

hm 

is the sum of m — 1 numbers, each of which is +1. Now adding +1 or —I toa 
number changes it from odd to even, and vice versa. The sum of m — 1 numbers 
each +1 is therefore an even integer if m is odd, and an odd integer if m is even. 
Consequently the sequence of ratios 

fla == in) a fla) 


i, 


cannot possibly converge, since it is a sequence of integers which are alternately 
odd and even. J 


=a 
OF 
a 
Ni 


In addition to its role in the previous theorem, the Weierstrass M-test is an ideal 
tool for analyzing functions which are very well behaved. We will give special 
attention to functions of the form 

(o3) 
f= Dan Gx =a)”, 


n=0 


which can also be described by the equation 


.eoe= s (COP 


n=0 


Ni— 


FIGURE 12 for fr(x) = a,(x — a)". Such an mfinite sum, of functions which depend only 
on powers of (x — a), 1s called a power series centered at a. for the sake of 


simplicity, we will usually concentrate on power series centered at 0, 


oN) 


ix) = ara 


n=0 


THEOREM 6 
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One especially important group of power series are those of the form 


2° f™@M 


(x: — Gy". 
n! 


n=0 
where f is some function which has derivatives of all orders at a; this series is 
called the Taylor series for f at a. Of course, it is not necessarily true that 


0 (Nn), 
. (a) 
ha ) FO ea)" 
n=0 ; 


this equation holds only when the remainder terms satisfy lim Rp.@(x) = 0. 
n—->oX 
oO 


We already know that a power series ) a,x" does not necessarily converge for 


n=0 
all x. For example, the power series 
ro ar x) x! i 
x _ — ———— eee 
3 5 7 
converges only for |x| < 1, while the power series 
, 
‘= Y : ne 7 x ie 
x-—~—+—-—+— 
2 3 4 5 


converges only for —1 <x < 1. It is even possible to produce a power series which 
converges only for x = 0. For example, the power series 
x 


) nix" 


n=0 


does not converge for x 4 0; indeed, the ratios 


(n+ 1)! (x"*!) 
———_—— =(n+1)x 
n! x” 
°° 
are unbounded for any x 4 0. If a power series ) a,x" does converge for 


n=0 
x 


some x9 4 0 however, then a great deal can be said about the series ) a,x" for 


n=0 
|x| < |xol. 


Suppose that the series 
2, 3) 


Ff (xo) = ye anXo" 


n=0 


converges, and let a be any number with 0 < a < |xo|. Then on |—a,a] the series 


~~ 


1 E 9 Sax” 


n=(0 
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converges uniformly (and absolutely). Moreover, the same is true for the series 


2= > na,x” 
n=l 
Finally, f is differentiable and 
co 
+ ji 3 — = nanx" | 
n=1 
for all x with |x| < |xo]. 
SO 
PROOF Since Yo dnxo” converges, the terms a,x9" approach 0. Hence they are surely 
n=0 


bounded: there is some number M such that 
lanxo"| = |an| - |[xo"| < M for all n. 
Now if x is in [—a, a], then |x| < Ja], so 


lanx"| = |an| - [x"| 


< lay, | 3 la” | 
n 


(this is the clever step) 


a 
= |a,| - xo!” : | aa 


n 
a 


XQ 


<M 


But |a/xo| < 1, so the (geometric) series 


YM 


n=0 


n n 


a a 


Xo Xo 


oO 
—M > 
n=0 


converges. Choosing M - |a/xo|" as the number M,, in the Weierstrass M-test, it 


follows that ) a,x" converges uniformly on [—a, a]. 


n=0 
x 


To prove the same assertion for g(x) = ) na,x" | notice that 
n=! 


Ina,x" | =nla_|- |x" | 


<nap| - |a"~"| 
lan| _ yn : 
am = [xol st |: = 
la| x0 
M a-|" 
Sn | 
lal | Xo 
Since |a/xo| < 1, the series 
SO M a n M o¢ a n 
Ra ee ee ee 
la| x0 lal x0 
n= n=] 


converges (this fact was proved in Chapter 23 as an application of the ratio test). 
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‘ 
Another appeal to the Weierstrass M-test proves that ) na,x"—' converges uni- 


n=] 
formly on [—a, a]. 


Finally, our corollary proves, first that g 1s continuous, and then that 


(CLS aE gan” for x in [—a, a]. 


i 


Since we could have chosen any number a with 0 < a < |xo|, this result holds for 
all x with |x| < |xo|. 


We are now in a position to manipulate power sertes with ease. Most algebraic 
manipulations are fairly straightforward consequences of general theor ems about 


CO 
infinite series. For example, suppose that f(x) = ais and ¢(x%) = oe . 
n=0 n=0 


where the two power series both converge for some xo. ‘Then for |x| < |xo| we 
have 


fore) oo fo6) CO 
De aie oF Daal! = igo: a Dax ) = iG =e Dice 
n=0 n=0 n=0 n=0 


So the:series A(x) = we: + b,)x" also converges for |x 
n=O 


,andh=f+g 


for these x. 
The treatment of pica Is OS a httle more involved. If |x| < |xo|, then we 


know that the series er came nara Ss b,x" converge absolutely. So it follows from 


n=0 n=0 
(o.@) oo 
Theorem 23-9 that the product y ae . b,x" is given by 
n=0 n=0 
~ OO 
b> ye ajx' bjx!, 
ei 0 


where the elements a;x'bjx/ are arranged in any order. In particular, we can 
choose the arrangement 


agbo + (agb, + a,bo)x + (agb2 + ayb, + aobo)x? ve 


which can be written as 
ow 


H 
) Cpe Oe ey = ) arDy—k. 


n=0 k=0 


This is the “Cauchy product” that was introduced in Problem 23-10. Thus, the 
CO 


Cauchy product A(x) = ) Gi) also couwercescion |) <2 vq) ancl: =") estar 
n=O 
phiese-v. 
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a 
Finally, suppose that f(x) = ) anx", where ag ~ 0, so that f(0) = ag ¥ 0. 
n=0 
>, ©) 
Then we can try to find a power series ) b,x" which represents 1/f. This means 
n=0 


that we want to have 


5 
yan” do bax" =1=1+0-x+0-x-+4+---. 
n=0 n—0 


Since the left side of this equation will be given by the Cauchy product, we want 


to have 


agbo = | 
apb; + ajbop = 0 
agb2 + a,b, + arb = 0 


Since ag # 0, we can solve the first of these equations for bg. Then we can solve 
(© @) 


the second for b;, etc. Of course, we still have to prove that the new series a Bax" 
n=0 
does converge for some x 4 0. This is left as an exercise (Problem 18). 
For derivatives, Theorem 6 gives us all the information we need. In particular, 
when we apply Theorem 6 to the infinite series 


es j ; a x x! x? 
Lie 25 eo 
eae, yo a ee ee 
COSX = = 31 (pgp ear alee 


we get precisely the results which are expected. Each of these converges for any xo, 
hence the conclusions of Theorem 6 apply for any x: 


— ' Sy* 9 5a ae 
snxv)— "ag dome pee te eens 
ele 2% Ax? 6x° rene 
cos (x) = — oT + a — ee es =-—smx, 
2x 3x? eS 
exp (x)= 1 + TI -} Sr ++ +2 = exp(x). 


For the functions arctan and f(x) = log(1 +x) the situation is only slightly more 
complicated. Since the series 
ro) x? 


3 
“3 
arctanx =x ——— +——— +: 
3 5 
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converges for xg = |, it also converges for |x| < 1, and 


| 
5 j 
arétan (Whe ee eee forint <1. 
ees 
In this case, the series happens to converge for x = —1 also. However, the formula 
for the derivative is not correct for x = | or x = —1]; indeed the series 
a 
lee ee 
diverges for x = | and x = —1. Notice that this does not contradict Theorem 6, 
which proves that the derivative is given by the expected formula only for |x| < |xol- 
Since the series 
o: Z 
log(1 + x) ae Ned 
og oe) = = ——— SS oe 
2 3 4 5 
converges for x9 = I, it also converges for |x| < 1, and 
I 2s 
=log(l+x)=1l—x+x°-x°4--- for [xl 
l+.x 
In this case, the original series does not converge for x = —1; moreover, the 


differentiated series does not converge for x = 1. 
All the considerations which apply to a power series will automatically apply to 
its derivative, at the points where the derivative is represented by a power series. 


If 


(@ 8) 


f(x) = are 


n=0 


converges for all x in some interval (—R, R), then Theorem 6 implies that 


oO 


(MES) = Yeas 
n=l 
for all x in (—R, R). Applying Theorem 6 once again we find that 


OO 


of Gs) = Yinin = Dak. 


n=2 


and proceeding by induction we find that 
ie, 2) 
fF") = Yi n(n =f). eee eee 
n=k 
Thus, a function defined by a power series which converges in some interval 
(—R, R) is automatically infinitely differentiable in that interval. Moreover, the 
previous equation implies that 


f® (0) = kia, 
so that 
f™ (0) 
ki 


1 
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In other words, a convergent power series centered at 0 is always the Taylor series at 0 of the 
function which it defines. 


On this happy note we could easily end our study of power series and ‘Taylor 
series. A careful assessment of our situation will reveal some unexplained facts, 
however. 

The ‘Taylor series of sin, cos, and exp are as satisfactory as we could desire; 
they converge for all x, and can be diflerentiated term-by-term for all x. The 
‘Taylor series of the function f(x+) = log(1 + x) is slightly less pleasing, because it 
converges only for —1 < x < 1, but this deficiency 1s a necessary consequence of 
the basic nature of power series. If the ‘Taylor series for f converged for any xo 
with |xo| > 1, then it would converge on the interval (—|xo|, |xo|), and on this 
interval the function which it defines would be differentiable, and thus continuous. 
But this is impossible, since it is unbounded on the interval (—1. 1), where it equals 
log(1 +x). 


The ‘Taylor series for arctan is more difficult to comprehend— there seems to 


be no possible excuse for the refusal of this series to converge when |x| > 1. This 
mysterious behavior is exemplified even more strikingly by the function f(x) = 
1/d + x), an infinitely differentiable function which is the next best thing to a 
polynomial function. ‘The ‘Taylor series of f 1s given by 


] 
oy Tae? = | — x24 gt? — 


If |x| => 1 the Taylor series does not converge at all. Why? What unseen obstacle 
prevents the ‘Taylor series from extending past 1 and —1? Asking this sort of 
question is always dangerous, since we may have to settle for an unsympathetic 
answer: it happens because it happens— that’s the way things are! In this case 
there does happen to be an explanation, but this explanation is impossible to give 
at the present time; although the question is about real numbers, it can be answered 
intelligently only when placed in a broader context. It will therefore be necessary 
to devote two chapters to quite new material before completing our discussion of 
Taylor series in Chapter 27. 


PROBLEMS 


1. For each of the following sequences {f,}, determine the pointwise limit of 
{fn} (if it exists) on the indicated interval, and decide whether { f,} converges 
uniformly to this function. 


fi) fae) = oe econ (0. IN) 


- . 0, x= 
(11) tn Ca ar of 


n 
on fa, b], and on R. 
n, 
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(i) fil) =<, on (1, 00). 


(iv) f,(x)= enn on [—1, 1]. 


VY) £.0= =. onR 


n 


This problem asks for the same information as in Problem 1, but the functions 
are not so easy to analyze. Some hints are given at the end. 


(i)  fx(x) =x" — x" on [0, 1]. 


‘ nx 
(a) fn(x) = Line [0, 00). 


(iii) fal) = 22+ on [a,oo),a > 0. 

(iv) ful) = y[x2+ on R. 

M Koi= jx+t — x on [a, 00), a > 0. 

Wi) f(x) = jx+t — J on [0, 00). 

(vii) fx(x) =n (a = ‘) on [a, 00), a > 0. 


(vii) f,(x) =n ( + : - 4) on [0, 00) and on (0, 00). 
I 


Hints: (i) For each n, find the maximum of | f — f,| on [0, 1]. (ii) For each n, 
consider | f(x) — fn(x)| for x large. (iil) Mean Value Theorem. (iv) Give a 
separate estimate of | f(x) — f,(x)| for small |x|. (vii) Use (v). 


Find the Taylor series at 0 for each of the following functions. 


So PG ae 


x —'a 


ii) f(x) =log(x—a), a<0O. 


(iii) f(x)= =(1—x)-'. (Use Problem 20-21.) 


l—x 
] 
v1 =e 


(v) f(x) = aresin x. 


(iv) f(x)= 
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= 


Find each of the following infinite sums. 


/ pe a 
(1) WES att ay 


(ii) = = i oy? so for [x] < I. 


Hint: What is 1 —x + x? —x?+4+.-..? 


a) ee, for la eed) 


Hint: Differentiate. 
Evaluate the following infinite sums. (In most cases they are f(a) where a 
is some obvious number and f(x) is given by some power series. ‘To evalu- 


ate the various power series, manipulate them until some well-known power 
series emerge.) 


e eo (=| yn Q2n zy 2n 
y) sh (2n)! ; 
n=0 
a | 
(u) S (2n)! 
n=0 ; 
o | | 2n+l1 
ai) ae (3) 
n=0 
i eae n 
(1v) Be AR 
n=0 — 
= l 
(v) ——_——_.. 
ce seem 1) 
(vi) 2n + | 
V1 
Inyl 
n=0' n 


If f(x) = (sinx)/x for x 4 0 and f(0) = 1, find f“(0). Hint: Find the 


power series for f. 


23 
, a 
In this problem we deduce the binomial series (1 +.)* = ) ( )x" 
n 
n=0 


without all the work of Problem 23-21, although we will use a fact established 
o% 


; : a . 
in part (a) of that problem—the series f(x) = y ( x does converge for 
n 
n=O); > 


wl fereds | 


ba cell 


10. 


i. 
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(a) Prove that (1 + x) f’(x) = af (x) for |x| < 1. 

(b) Now show that any function f satisfying part (a) is of the form f(x) = 
c(1 +.x)* for some constant c, and use this fact to establish the binomial 
series. Hint: Consider g(x) = f(x)/(1 +x)*. 


Suppose mat fn are nonnegative bounded functions on A and let M, -= 


sup fr. If y fn converges uniformly on A, does it follow that ; M,, con- 


n=1 n=| 
verges (a converse to the Weierstrass M-test)? 


Prove that the series 
oO 


x 
d n(1 +nx2) 


converges uniformly on R. 


(a) Prove that the series 


2 


converges uniformly on |a, oo) for a > 0. Hint: lim(sinh)/h = 1. 
f h-O 


< . . NV 
(b) By considering the sum from N to oo for x = 2/(23%), show that the 
series does not converge uniformly on (Q, 00). 


(a) Prove that the series 


x 


i NX 
I= Dae 


n=( 


converges uniformly on [a@, 00) fora > 0. Hint: First find the maximum 
~ 9 
of nx/(1 +n4x*) on [0, 00). 


(b) Show that 


° . . - 9 
and by using an integral to estimate the sum, show that f(1/N~) > 1/4. 
Conclude that the series does not converge uniformly on R. 


(c) What about the series 
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12. (a) 


13. (a) 


Use Problem 15-33 and Abel’s Lemma (Problem 19-36) to obtain a “uni- 
form Cauchy condition”, showing that for any ¢ > 0, 


n 


sin kx 
eS, 


k=m 


can be made arbitrarily small on the whole interval [¢, 27 — e] by choos- 
ing m (and n) large enough. Conclude that the series 


COC; 
Se sim 71x 
n 


n=1 


converges uniformly on [¢, 27 — e] for € > 0. 
For x = 2/N, with N large, show that 


2N N N 
S| sinkx = y | sin kx >. 

54 
k=N k=0 


Conclude that 


3 sin kx ] 
nN k “maar, 


and that the series does not converge uniformly on [0, 27]. 


Suppose that f(x) = eS converges for all x in some interval 
n=0 

(—R, R) and that f(x) = 0 for all x in (—R, R). Prove that each a, = 0. 

(If you remember the formula for a, this is easy.) 


Suppose we know only that f(x) = 0 for some sequence {x,} with 
lim x, = 0. Prove again that each a, = 0. Hint: First show that 
n—->O©o 


f@) = a9 =; then that 7 O) = Gye atc: 


This result shows that if f(x) = el sin 1/x for x #0, then f cannot 
possibly be written as a power series. It also shows that a function defined 
by a power series cannot be 0 for x < 0 but nonzero for x > 0 —thus a 
power series cannot describe the motion of a particle which has remained 
at rest until time 0, and then begins to move! 


CO [e. 6) 
Suppose that f(x) = Se and.24)5 = bax” converge for all x 


n=0 n=0 
in some interval containing O and that f(tn) = 2(tm) for some sequence 


{tm} converging to 0. Show that a, = bn for each n. 


CO 
14. . Prove; thatif = AS eu is an even function, then a, = 0 for n odd, 


n=0 


and if f is an odd function, then a, = 0 for n even. 


15; 


*16. 


Ly. 


18. 
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Show that the power series for f(x) = log(] — x) converges only for —1 < 
x < 1, and that the power series for g(x) = log[(1 +.+)/(1 — x)] converges 
only for x in (—1, 1). 


Recall that the Fibonacei sequence {a,} is defined by @; = a2 = 1, a4) = 
Cae taal Ora 


(a) Show that ay41/a, < 2. 
(by Let 


le.) 


f@)= odes =Itxt2x743x°4---, 


n=1 


Use the ratio test to prove that f(x) converges if |x| < 1/2. 
(c) Prove that if |x| < 1/2, then 


—] 
(5 aaa 
Hint: ‘This equation can be written f(x) — xf (x) — Totes, ely 
(d) Use the partial fraction decomposition for 1/(x? +.x — 1), and the power 
series for |/(x — a), to obtain another power series for f. 
(e) Since the two power series obtained for f must be the same (they are 
both the ‘Taylor series of the function), conclude that 


eee ce 1evo\. 


2) 2 


- -_ 


Qn = 
J5 
ee (a a and g(x) = Ee Suppose we merely knew that 
n=0 n=() 
[e. Si 
7 (ee(%) = Ss CX" for some c,, but we didn’t know how to multiply series 
n=0 


in general. Use Leibniz’s formula (Problem 10-20) to show directly that this 
series for fg must indeed be the Cauchy product of the series for f and g. 


OO 


n= 


for simplicity, we'll assume that a9 = 1. Let {by} be the sequence defined 
recursively by 


bo — || 


n—-1 


b, =— 3 DLGn he 
[k=-(0) 
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% 
The aim of this problem is to show that ) b,x" also converges for some 


n=0 
x #(, so that it represents 1/f for small enough |x]. 


(a) Ifall la,xo”"| < M, show that 


n—] 
\buxo"| <M D7 |bi-xo' 
k=0 
(b) Choose M so that |a,x9"| < M, and also so that M/(M2 — 1) < 1. Show 
that 
PA | es 
oe) 
(c) Conclude that ie. converges for |x| sufficiently small. 
n=0 
ie, ®) o@) 
*19. Suppose that OL converges. We know that the series f(x) = Sa 
n=O n=0 


must converge uniformly on [—a,a] for 0 <a < 1, but it may not converge 

uniformly on [—1, 1]; in fact, it may not even converge at the point —1 

(for example, if f(x) = log(] + x)). However, a beautiful theorem of Abel 

shows that the series does converge uniformly on [0, 1]. Consequently, f is 
lo. 8) oO 

continuous on [0, 1] and, in particular, ee =r Daa Prove Abel’s 


ial 


n=0 n=0 
Theorem by noticing that if jay, +++-+4a,| < €, then |ayx!" +++++ay,x"| < 8, 
by Abel’s Lemma (Problem 19-36). 
oO 
20. A sequence {a,} 1s called Abel summable if hm ae’ exists; Prob- 
Sa 
n=0 
lem 19 shows that a summable sequence is necessarily Abel summable. Find 
a sequence which is Abel summable, but which is not summable. Hint: Look 
over the list of Taylor series until you find one which does not converge at 1, 
even though the function it represents is continuous at |. 


21. (a) Using Problem 19, find the following infinite sums. 


(o.2) CO 
(b) Let ) Cy be the Cauchy product of two convergent power series ) ai 
n=0 n=0) 


OO ie, @) 
and ) b,, and suppose merely that ) Cn converges. Prove that, in fact, 
n=() n=O 


OO fo @) 
it converges to the product ) Ay * ) Dee 
n=0 n=0 


22 


233 


gS 


2s 


26. 


27. 


(a) 
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Show that the series 


fo @) 9) 
yet yt 


2a | 2n+2 
n= 
converges uniformly to 5 log(x +1) on [—a,a] for 0 <a < 1, but that 
at | it converges to log 2. (Why doesn't this contradict Abel’s ‘Theorem 
(Problem 19)?) 


Suppose that {f,,} is a sequence of bounded (not necessarily continuous) 
functions on [a,b] which converge uniformly to f on [a,b]. Prove that 
f 1s bounded on [a, b]. 

Find a sequence of continuous functions on [a, b] which converge point- 
wise to an unbounded function on [a, 5]. 


Suppose that f is differentiable. Prove that the function f’ is the pointwise 
limit of a sequence of continuous functions. (Since we already know exam- 


ples of discontinuous derivatives, this provides another example where the 


pointwise limit of continuous functions is not continuous.) 


Find a sequence of integrable functions { f;,} which converges to the (nonin- 
tegrable) function f that is 1 on the rationals and O on the irrationals. Hint: 
Each f, will be 0 except at a few points. 


(a) 


(b) 


Prove that if f is the uniform limit of {f,} on [a,b] and each fy, is 
integrable on [a, b], then so is f. (So one of the hypotheses in Theorem |! 
was unnecessary.) 

In Theorem 3 we assumed only that { f,,} converges pointwise to f. Show 
that the remaining hypotheses ensure that { f,} actually converges uni- 
formly to f. 

Suppose that in Theorem 3 we do not assume { f,} converges to a func- 
tion f, but instead assume only that f, (x9) converges for some x9 in 
[a, b]. Show that f,, does converge (uniformly) to some f (with f’ = g). 
Prove that the series 

‘i ea” 

ipl todd 


converges uniformly on [0, 00). 


Suppose that f, are continuous functions on [0, 1] that converge uniformly 
to f. Prove that 


l-I/n ] 
lim / jJa= i ve 
Proce eh) 0 


Is this true if the convergence isn’t uniforni? 
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28. 


29. 


30. 


(a) Suppose that {f,} is a sequence of continuous functions on [a,b] which 
approaches 0 pointwise. Suppose moreover that we have fi(«) > fn41(*) 
> 0 for all n and all x in [a,b]. Prove that {f,} actually approaches 0 
uniformly on [a,b]. Hint: Suppose not, choose an appropriate sequence 
of pomts x, in [a,b], and apply the Bolzano-Weierstrass theorem. 

(b) Prove Dinr’s Theorem: If { f,} is a nonincreasing sequence of continuous 
functions on [a,b] which approaches the continuous function f point- 
wise, then {f,} also approaches f uniformly on [a,b]. (The same result 
holds if {fr} 1s a nondecreasing sequence.) 

(c) Does Dini’s Theorem hold if f isn’t continuous? How about if [a,b] is 
replaced by the open iuterval (a, b)? 


(a) Suppose that {/f,} 1s a sequence of continuous functions on [a,b] that 
converges uniformly to f. Prove that if x, approaches x, then fi,(Xn) 
approaches f(x). 

(b) Is this statement true without assuming that the /f, are continuous? 

(c) Prove the converse of part (a): If f is continuous on [a, b| and { f,} 1s 
a sequence with the property that f,(x,) approaches f(*) whenever x, 
approaches x, then f, converges uniformly to f on [a,b]. Hint: If not, 
there is an € > O anda sequence x, with | fi, in) — f (%n)| > € for infinitely 
many distinct x,. ‘Then use the Bolzano-Weierstrass theorem. 


This problem outlines a completely different approach to the integral; con- 


sequently, it is unfair to use any facts about integrals learned previously. 


(a) Let s be a step function on [a, b], so that s is constant on (f;-1.%) for 


b n 
some partition {fo,..., t} of [a,b]. Define / (is Be fa —t;-,) where 
a a 
s; 1s the (constant) value of s on (t;-),4;). Show that this definition does 
not depend on the partition {fo, .... tn}. 

(b) A function f is called a regulated function on {a, b] if it is the uniform 
limit of a sequence of step functions {s,} on [a,b]. Show that in this 
case there is, for every ¢ > 0, some N such that for m,n > N we have 
[si (x)= Suey tere for all x injab]- 


b 
(c) Show that the sequence of numbers | / | will be a Cauchy sequence. 
a 


(d) Suppose that {f,} 1s another sequence of step functions on [a@, b] which 
converges uniformly to f. Show that for every ¢ > 0 there is an N such 
Sy (Xi — ty (x) < € for x m [a, bp 


that for n > N we have 


1H 0 NCO 


b b 
(ce) Conclude that lm if 5, — aun / t,. VLhis means that we can define 
a 


a 
b 
frto be im s, for auy sequence of step fuuctions {s,} converging 
a 
uniformly to f. “Phe ouly remaiming question is: Which functious are 


regulated? 
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*(f) Prove that a continuous function is regulated. Hint: To find a step func- 
tion s on [a,b] with | f(*) — s(x)| < « for all x in [a,b], consider all y 
for which there is such a step function on [a, y|. 

(g) Every step function s has the property that lim s(x) and lim s(x) exist 


x—at x—a~ 


for all a. Conclude that every regulated function has the same property, 
and find an integrable function that is not regulated. (It is also true 
that, conversely, every function f with the property that lim f(x) and 


x—a* 
lim f(x) exist for all a is regulated.) 
x—a™ 
*31. Find a sequence {f,} approaching f uniformly on [0, 1] for which we have 
lim (length of f, on [0,1]) # length of f on [0,1]. (Length is defined in 
n> OO : ; 
Problem 13-25, but the simplest example will involve functions the length of 


whose graphs will be obvious.) 


CHAPTER COMPLEX NUMBERS 


With the exception of the last few paragraphs of the previous chapter, this book 
has presented unremitting propaganda for the real numbers. Nevertheless, the 
real numbers do have a great deficiency— not every polynomial function has a 
root. ‘The simplest and most notable example is the fact that no number x can 
satisfy x7 + 1 = 0. This deficiency is so severe that long ago mathematicians 
felt the need to “invent” a number i with the property that i? + 1 = 0. For a 
long ume the status of the “number” 7 was quite mysterious: since there 1s no 
number x satisfying x7 + | = 0, it is nonsensical to say “let i be the number 
satisfying i7 + 1 = 0.” Nevertheless, admission of the “imaginary” number i to 
the family of numbers seemed to simplify greatly many algebraic computations, 
especially when “complex numbers” a + bi (for a and b in R) were allowed, and 
all the laws of arithmetical computation enumerated in Chapter 1 were assumed 
to be valid. For example, every quadratic equation 


3” 


ax? +bx+c=0 (a 10) 
can be solved formally to give 


Shey bh? —dac 
DN C 
2a 


-_ 


7 =p —¥ b- = 4dac 


ion 
2a 


-_ 


a 


~ 9) ~ . . 
If b- —4ac => 0, these formulas give correct solutions; when complex numbers are 
allowed the formulas seem to make sense in all cases. For example, the equation 


ee | 
has no real root, smce 
a txtl=(+5)+250, forall x. 
But the formula for the roots of a quadratic equation suggest the “solutions” 


— aj29% 
a) 


eo wd). = —= 


~ 


if we understand / —3 to mean /3- (—1) = or —| = V3i. then these numbers 
would be 
* 4 | =aw3 
4 


. 


l. 


=a” and fae 
It is not hard to check that these, as yet purely formal, numbers do indeed satisfy 
the equation 


feos se || 0) 
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It is even possible to “solve” quadratic equations whose coefficients are themselves 
complex numbers. For example, the equation 


De cise te oO) 


ought to have the solutions 


Se 4) ae 3 — 
epee Tg a go ; 


where the symbol /—3 — 44 means a complex number @ + fi whose square 1s 
—3-—4i. In order te have 


(a + Bi)? =a? — B? + 2api = —3 —4i 
we need 


ae = B° = =) 
2ap = —4. 


These two equations can easily be solved for real a and B; in fact, there are two 
possible solutions: 

i a=—| 

g=-2 and oy 
Thus the two “square roots” of —3 — 47 are 1 — 2; and —1 + 2i. There is no 
reasonable way to decide which one of these should be called / —3 — 47, and which 
— /—3 —4i; the conventional usage of /x makes sense Only for real x = 0, 
which case /¥ denotes the (real) nonnegative root. For this reason, the solution 


2 te oe 


= a) 


— 


- 


must be understood as an abbreviation for: 


—Il+r 
>= —. | where r 1s one, of the square rootstof —3 — 47. 


~ 


With this understanding we arrive at the solutions 


—l+1—-2i 
CS ol 
2 
eat 2) ate 
> =—_— +1; 


as you can easily check, these numbers do provide formal solutions for the equation 
2 . 
po Be: Bieta eae — 1 
kor cubic equations complex numbers are equally useful. Every cubic equation 
3 2 
ax” +bx*+cx +d =0 (az) 


with real coefficients a, b,c, and d, has, as we know, a real root a, and if we divide 
3 » . : 
ax° +bx~+ex +d by x —@ we obtam a second-degree polynomial whose roots are 
7 ] ~ . . 
the other roots of ax3+bx*-+ex+d = 0; the roots of this second-degree polynomial 
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may be complex numbers. ‘Thus a cubic equation will have either three real roots 
or one real root and 2 complex roots. ‘The existence of the real root is guaranteed 
by our theorem that every odd degree equation has a real root, but it is not really 
necessary to appeal to this theorem (which is of no use at all if the coefficients 
are complex); m the case of a cubic equation we can, with sufficient cleverness, 
actually find a formula for all the roots. ‘The following derivation is presented not 
only as an interesting illustration of the ingenuity of early mathematicians, but as 
further evidence for the importance of complex numbers (whatever they may be). 

‘lo solve the most general cubic equation, it obviously suffices to consider only 
equations of the form 


ee 


. . . . . . 2 ye . ~ 
It is even possible to eliminate the term involving x~, by a fairly straight-forward 
mampulation. If we let 


el b 
y= y 2a 
then 
bey be 
3 2 
= y° — by* = ee 
x \ D 3 7 
2.2 2by be 
i —) 3 9° 
NTO) 
O=x2+bx7 +ex+d 
me: bs nite 2bty |b ‘s Vag 
tS fy oe = = ES pri = = OS G 
OO Ee QE 7+ 3 
noe bo we be ; ee ; 
— A S| e  WOI ct 


The right-hand side now contains no term with y?. If we can solve the equation 
for y we can find x; this shows that it suffices to consider m the first place only 
equations of the form 
tp ao! 
In the special case p = 0 we obtain the equation x? = —q. We shall see later on 
that every complex number does have a cube root, in fact it has three, so that this 
equation has three solutions. The case p 4 0, on the other hand, requires quite 
an ingenious step. Let 
p 


( SRS [oy 
) 3w 
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Then 
EE Mi ay EW tye Se 
Q=x> + px +q=(u aa) + p(w aa 
3u2p = 3 wp? p> p? 
3 
=wo— — —{ + pw-—+ 
tig ee ee ek oy 
i 
=w — Tw 1 


This equation can be written 


2 
27(w*)* + 27q(w) — p? =0, 
which is a quadratic equation in w? (!!). 


‘Thus 


, 21g + (27)?q? +4 -27p3 


DiRT 


wW--= 


Remember that this really means: 


: 
. g a pe 
w=-——4+yr, where risa square root of — i 


2 a 2 


We can therefore write 


this equation means that w is some cube root of —q/2 +r, where r is some square 
. 9 rm. . reer ed ~ 

root of g?7/4 + p3/27. This allows six possibilities for w, but when these are 

substituted into (%*), yielding 


it turns out that only 3 different values for x will be obtained! An even more 
surprising feature of this solution arises when we consider a cubic equation all of 
whose roots are real; the formula derived above may still involve complex numbers 
in an essential way. For example, the roots of 


x — 157—4=0 
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are 4, —2 + V3, and —2 — ¥3. On the other hand, the formula derived above 
(with p ==15, g = —4) gives as one solution 


a pO ee 


3 smyth 
Now, 
(ebay = 2aee oe Pe Ben eae 
=8+12i -6-i 
=2+4+1/1i 


so one of the cube roots of 2+ 1]1i is 2+7. Thus, for one solution of the equation 
we obtain 
15 
Ocpel 
Bo. | 

6+3i 6-3) 

90 — 45i 
2G 


=) See 


=~) Ee. 
= 4 (1). 


The other roots can also be found if the other cube roots of 2 + 11i are known. 
The fact that even one of these real roots is obtained from an expression which 
depends on complex numbers is impressive enough to suggest that the use of 
complex numbers cannot be entirely nonsense. As a matter of fact, the formulas 
for the solutions of the quadratic and cubic equations can be interpreted entirely 
in terms of real numbers. 

Suppose we agree, for the moment, to write all complex numbers as a + bi, 
writing the real number a as a + Of and the number 7 as 0 + Ii. The laws of 
ordinary arithmetic and the relation i? = —1 show that 


(a+ bi)+(c+di)=(a+c)+(b+d)i 
(a+ bi)- (e+ di) = (ac — bd) + (ad + be)i. 


Thus, an equation like 
I 2)-G+li=1+7 
may be regarded simply as an abbreviation for the “eo equations 


a — ae 
ae 


DEFINITION 
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The solution of the quadratic equation ax? + bx + ¢ = 0 with real coefficients 
could be paraphrased as follows: 


If u* — v2 = b? — 4ac, 
nu=O, 
Ca CFs vi)? = b* — Aac), 


9) 
—b+u\~ v y —b+u 
a| {| ———]}] -(— + b | ——— i. 
( 2a ) fe 2a ae 
—| 
s[o(=2##) (2) 
2a 2a 


; —b+ut+oi : —hb+u+ul 
Le., then a | —————. ] +. b | ————— _ ] + c = 0 


2a 24 


then 


It is not very hard to check this assertion about real numbers without writing 


ae 


down a single “7,” but the complications of the statement itself should convince 
you that equations about complex numbers are worthwhile as abbreviations for 
pairs of equations about real numbers. (If you are still not convinced, try para- 
phrasing the solution of the cubic equation.) If we really intend to use complex 
numbers consistently, however, it is going to be necessary to present some reason- 
able definition. 

One possibility has been implicit in this whole discussion. All mathematical 
properties of a complex number a + bi are determimed completely by the real 
numbers a and b; any mathematical object with this same property may reasonably 
be used to define a complex number. ‘The obvious candidate is the ordered pair 
(a, b) of real numbers; we shall accordingly define a complex number to be a pair 
of real numbers, and likewise define what addition and multiplication of complex 
numbers is to mean. 


A complex number 1s an ordered pair of real numbers; if z = (a, b) is a com- 


plex number, then a is called the real part of <, and 4 1s called the imaginary 


part of <. The set of all complex numbers is denoted by C. If (a, b) and (c,d) 
are two complex numbers we define 


(a.b)+ (c,d) =(a+c,b+d) 
(a, b) “esa (@°c—b-d,a-a+ be). 


(The + and - appearing on the left side are new symbols being defined, while the 
+ and - appearing on the right side are the familiar addition and multiplication 
for real numbers.) 


When complex numbers were first introduced, it was understood that real num- 
bers were, in parucular, complex numbers; if our definition is taken seriously this 
is not true a real number is not a pair of real numbers, after all. Thuis difficulty 
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DEFINITION 


is only a mmor annoyance, however. Notice that 


(a,0) + (b,0) = (a+ b,0+4+0) = (a +b, 0), 
(a,0)- (6,0) = eb —0-0, a- 0+ 0b) = (a -8, 0): 


this shows that the complex numbers of the form (a, 0) behave precisely the same 
with respect to addition and multiplication of complex numbers as real numbers 
do with their own addition and multiplication. For this reason we will adopt the 
convention that (a,0) will be denoted simply by a. The familiar a + bi notation 
for complex numbers can now be recovered if one more definition is made. 


Notice that i? = (0, 1) - (0, 1) = (-1.0) = —1 (the last equality sign depends 
on our convention). Moreover 


(a,b) = (a, O) + (0, db) 
= (a, QO) + (b, 0) - (0, 1) 
=at bi. 


You may feel that our definition was merely an elaborate device for defining 
complex numbers as “expressions of the form a+ bi.” That is essentially correct; 
it is a firmly established prejudice of modern mathematics that new objects must 
be defined as something specific, not as “expressions.” Nevertheless, 1t is inter- 
esting to note that mathematicians were sincerely worried about using complex 
numbers until the modern defimtion was proposed. Moreover, the precise defini- 
tion emphasizes one miportant point. Our aim in introducing complex numbers 
was to avoid the necessity of paraphrasing statements about complex numbers in 
terms of their real and imaginary parts. ‘This means that we wish to work with 
complex numbers in the same way that we worked with rational or real numbers. 
For example, the solution of the cubic equation required writing x = w — p/3w, 
so we want to know that 1/w makes sense. Moreover, w? was found by solving a 
quadratic equation, which requires numerous other algebraic manipulations. In 
short, we are likely to use, at some time or other, any manipulations performed on 
real numbers. We certainly do not want to stop each time and justify every step. 
Fortunately this is not necessary. Since all algebraic manipulations performed on 
real numbers can be justified by the properties listed in Chapter 1, it is only nec- 
essary to check that these properties are also true for complex numbers. In most 
cases this is quite easy, and these facts will not be listed as formal theorems. For 
example, the proof of P1, 


ene) ee i (ery) = Cab) herd) (asf 


requires only the application of the definition of addition for complex numbers. 
The left side becomes 


(ljate]+e,[b+d]+ f). 


25. Complex Numbers 533 


and the right side becomes 


(a"+ [c+ ele) Ha Ff); 


these two are equal because PI! is true for real numbers. It is a good idea to 
check P2-P6 and P8 and P9. Notice that the complex numbers playing the role 
of O and | in P2 and P6 are (0,0) and (1, 0), respectively. It is not hard to figure 
out what —(a, b) is, but the multiplicative inverse for (a, b) required in P7 is a little 


trickier? 1f (d-D) S20, 0)) nem a* + b2 =e 0 anne 


a =) 
(a,b): (os a i) = (1), 
This fact could have been guessed in two ways. To find (x, y) with 
(Ga De (Xa) ClO) 
it is only necessary to solve the equations 


ax —by = 1, 


bx +ay =0. 
The solutions are x = a/(a? +b7), y = —b/(a* +b’). It is also possible to reason 
that if 1/(a + bi) means anything, then it should be true that 
l ] a—bi a—bi 


fopa eb, ab ae be 


Once the existence of mverses has actually been proved (after guessing the inverse 
by some method), 1t follows that this manipulation ts really valid; it is the easiest one 
to remember when the inverse of a complex number is actually being sought—1t 
was preciscly this trick which we used to evaluate 


15 15 63) 
ji eee 
90 — 45i 
F bce 


Unlike P1-P9, the rules P1O-P12 do not have analogues: it 1s easy to prove that 
there is no set P of complex numbers such that P1O-P12 are satisfied for all complex 
numbers. In fact, if there were, then P would have to contain | (since 1 = 12) and 
also —1 (since —1 = i7), and this would contradict P10. The absence of P10 P12 
will not have disastrous consequences, but it does mean that we cannot define 
< < w for complex z and w. Also, you may remember that for the real numbers, 
P10 P12 were used to prove that 1 + 1 4 0. Fortunately, the corresponding fact 
for complex numbers can be reduced to this one: clearly (1,0) + (1,0) 4 (0, 0). 

Although we will usually write complex numbers im the form a + bi, it 1s worth 
remembering that the set of all complex numbers C ts just the collection of all 
pairs of real numbers. Long ago this collection was identified with the plane, and 
for this reason the plane is often called the “complex plane.” ‘The horizontal axis, 
which consists of all pomts (a, 0) for a in R, ts often called the real axis, and the 
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vertical axis 1s called the wmaginary axis. ‘Two important definitions are also related 
to this geometric picture. 


DEFINITION If z = x+y is a complex number (with x and y real), then the conjugate Z 
of z is defined as 


rd om) 


and the absolute value or modulus |z| of z is defined as 


lz] =vVx? 4+ y°. 


=. He ff RNY es Fy <0 ) ) 9 Dae = : . 
DA a ee (Notice that x“ + y~ > 0, so that V.x~ + y~ is defined unambiguously; it denotes 


length 


. . 9) 2 
the nonnegative real square root of x* + y~.) 


ez =x-—ly : : So. 2 5 a 4 4 é ; 
: Geometrically, Z 1s simply the reflection of z im the real axis, while |z| is the 


Te Ae distance from z to (0,0) (Figure 1). Notice that the absolute value notation for 
complex numbers is consistent with that for real numbers. ‘The distance between 
two complex numbers z and w can be defined quite easily as |z—w]. The following 


theorem lists all the important properties of conjugates and absolute values. 


THEOREM 1 Let z and w be complex numbers. ‘Then 


= 
je 


— 
ae 
NI NI 


= z if and only if z is real (ve., is of the form a + Oi, for some real 


number a). 


3) zfw=Z+w. 


— 
A 
—— 
| 
nN 
II 
| 
aN 


(Oo) co i 


(6) -'=(@-',i£ 240. 
(7) (2a se E. 

(3) [dena] Sol Zlecalitile 

(9) Iza w] SolZltetitoll 


PROOF Assertions (1) and (2) are obvious. Equations (3) and (5) may be checked by straight- 
forward calculations and (4) and (6) may then be proved by a trick: 


0=O0=274+ (-2) =z+-2. so-¢=-E, 
| lz |) = zs ze so z-! =(z)7!. 


Equations (7) and (8) may also be proved by a straightforward calculation. “Phe 
only difficult part of the theorem is (9). This inequality has, in fact, already oc- 
curred (Problem 4-9), but the proof will be repeated here, using slightly different 
terminology. 

It is clear that equality holds in (9) if z = 0 or w = 0. It is also easy to see that (9) 
is true if z = Aw for any real number A (consider separately the cases A > O and 
4 < 0). Suppose, on the other hand, that z 4 Aw for any real number A, and that 


ISG IUERSE ee 
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w #0. Then, for all real numbers i, 


(fe 0 = (2 — Aw ee ae) 
= (z —Aw)- (Z—Aw) 
= 22+)? ww —A(wi+ ww) 
= 12 \w|? + lel? —AQ@wz + <0). 


Notice that wZ + zw is real, since 
Ween mee se 2a WZ 1 ZW We 2 


Thus the right side of (s) is a quadratic equation in A with real coefficients and no 
real solutions; its discriminant must therefore be negative. Thus 


(wz + eit)? —4)wl? - |Z)? <0: 
it follows, since wz + Zw and |w|- |z| are real numbers, and |w| - |z| > O, that 
(wz+zw) < 2|w|- lz]. 
brom this equality it follows that 


Iz+ wl? SSG See eae im), 
= |z|* + [wl + (wz + zw) 
< [</> + wl? + 2hw] - [eI 
= (|z| + |wl)>, 


which implies that 
Jz + wl] < [zl + lol. I 


The operations of addition and multiplication of complex numbers both have 
important geometric interpretations. The picture for addition is very simple (Fig- 
ure 2). Two complex numbers z = (a,b) and w = (c,d) determine a paral- 
lelogram having for two of its sides the line segment from (0,0) to z, and the 
line segment from (0,0) to w; the vertex opposite (0,0) is z + w (a proof of this 
geometric fact 1s left to you [compare Appendix | to Chapter 4)). 

The interpretation of multiplication is more involved. If c = 0 or w = 0, 
then z+ w = O (a one-line computational proof can be given, but even this 1s 
unnecessary the assertion has already been shown to follow from PI P9), so we 
may restrict our attention to nonzero complex numbers. We begin by putting every 
nonzero complex number into a special form (compare Appendix 3 to Chapter 4). 
For any complex number z 4 0 we can write 


. 
[=| 


z| Is a positive real number, while 


oo Pa lz| 


in this expression, 


Iz| 


= 
~ 


Iz| 


~ 
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so that z/|z| 1s a complex number of absolute value 1. Now any complex number 
. . 9) 3) . . . 
a=x+iy with 1 = |a| = x*+ y* can be written m the form 


length a = (cos@,sm@) = cos@ +7 sin@ 


for some number @. ‘Thus every nonzero complex number z can be written 
angle of @ radians - = r(cos@ +isin 6) 


for some r > 0 and some number @. ‘The number r is unique (it equals |z|), but é 


Is not unique; if 9 1s one possibility, then the others are 69 + 2km for k in Z—any 
one of these numbers is called an argument of z. Figure 3 shows z in terms of r 
and 6. (To find an argument 6 for z = x +/y we may note that the equation 


ENG UARaR eS 


x+iy = z= |z|(cos9 +i sin 0) 
means that 


x =alelnees G2 
y =saisinige 
So, for example, if x > 0 we can take 9 = arctan y/x; if x = 0, we can take 
G.=7q/2 whengy > "Oana = 37/2 when y <0) 
Now the product of two nonzero complex numbcrs 


z=r(cos@+isin@), 
w= s(cos@ +ising), 


-w=rs(cos@ +7 sin@)(cos@ +i sing) 
= rs|(cos@ cos @ — sin @ sin @) + i(sin 6 cos@ + cos 6 sin )| 
= rs[cos(@ + $) +i sn(6 + @)]. 


N 


Thus, the absolute value of a product is the product of the absolute values of the 
factors, while the sum of any argument for each of the factors will be an argument 
for the product. For a nonzero complex number 


z=r(cos6 +isin@) 
it is now an easy matter to prove by induction the following very important formula 
(sometimes known as De Mowre’s Theorem): 
z" = [z|"(cosn@ +isinn@), for any argument 6 of <. 
n 


This formula describes 2” so explicitly that it is easy to decide just when z” = w: 


THEOREM 2 Every nonzero complex number has exactly n complex mth roots. 
More precisely, for any complex number w 4 0, and any natural number 1, 
there are precisely n different complex numbers z satisfying 2” = w. 


PROOF ben 
w = s(cosd@+isnd) 


FIGURE 4 
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for s = |w| and some number ¢. Then a complex number 
z=r(cos@+isin@) 
satisfies 2” = w if and only if 
r"(cosn@ +isinn@) = s(cos@ +i sing), 
which happens if and only if 


5 


cosn@ +isinnd =cosd@+isn®@g. 


From the first equation it follows that 


where /s denotes the positive real nth root of s. From the second equation it 
follows that for some integer k we have 


o 2k 
6=) = —+— 
n n 
Conversely, if we choose r = (/s and 6 = 6 for some k, then the number z = 


r(cos@ +7 sin@) will satisfy 2” = w. ‘To determine the number of nth roots of w, 
it is therefore only necessary to determine which such z are distinct. Now any 
integer k can be written 

k=ngtk 


for some integer g, and some integer k’ between 0 and n — 1. Then 


cos& + isin @& = cos Oe +7 sin &. 


n 


This shows that every z satisfying z” = w can be written 


Z= 0/5 (CosOp4-4sin Q,). k—0,..0, 71 aa 


Moreover, it is easy to see that these numbers are all different, since any two 6 for 
k=0,...,1 —1 difler by less than 27. J 


In the course of proving Theorem 2, we have actually developed a method for 
finding the mth roots of a complex number. For example, to find the cube roots 
of i (Figure 4) note that |i] = | and that 2/2 is an argument for 7. The cube roots 
of i are therefore 


| a _ 7] 
-[cos E + isin Z ; 


oe 6 


aor mae = Piles. 3s 52 
3 6 = Cos er 
alee a Ae Smee, 47 _ 3h. | 8x 
- | COS out ae +7 sil 6. = 60S oot SID an: 


=) 
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Since 
TG V3 ; l 
cos — = —, m———, 
2 2 
S51 V8 57 | 
cos — = —- — i 
6 2 se aa) 
3 0 UE I 
cos — = 0, sin = —], 


the cube roots of 7 are 


‘ke oie ee 
, 7 nn 


i 
2 


- 


In general, we cannot expect to obtain such simple results. For example, to find 


the cube roots of 2 + Ili, note that |2 + 11/]| = J22+112 = V125 and that 


arctan Ls is an arguinent for 2+ 117. One of the cube roots of 2+ 117 1s therefore 


P arctan sal ee arctan H, 
125 | cos | ———— _]} +i sm | ————— 
3 3 
arctan i -£ arctan a 
25 cos | ——— _ ] +7 sm | ————— 


D 3) 


Previously we noted that 2 + i 1s also a cube root of 2+ 117. Since [2 +i7| = 
V2? +12 = V5, and since arctan 5 is an argument of 2 +7, we can write this 
cube root as 

2+i= V/5(cos arctan 4 + 7 sin arctan 5). 


These two cube roots are actually the same number, because 


arctan x | 


3 = arctan 5 


(you can check this by using the formula in Problem 15-9), but this is hardly the 


sort of thing one might notice! 

The fact that every complex number has an ath root for all 1 1s just a special 
case of a very important theorem. The number i was originally mtroduced in 
order to provide a solution for the equation x? + | = 0. The Fundamental Theorem 
of Algebra states the remarkable fact that this one addition automatically provides 
solutions for all other polynomial equations: every equation 


z” a ei. = ap — 0 aj.n.. 4,1 ine 


has a complex root! 

In the next chapter we shall give an almost complcte proof of the Fundamental 
Theorem of Algebra; the slight gap left in the text can be filled m as an exercise 
(Problem 26-5). ‘The proof of the theorem will rely on several new concepts which 
come up quite naturally in a more thorough mvestigation of complex numbers. 
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PROBLEMS 


1 


*8, 


Find the absolute value and argument(s) of each of the following, 


(i) 3+4i. 
Gi) (3 +4i)74 
Gian 
(vw) V¥3+4i. 
( 


vy) (3+4i|. 
Solve the following equations. 


(i) x7+ix4+1=0: 

Gy x¢ aa HLS 0. 

(ii) x? +2ix—1=0. 
ap | eae, 
(2+i)x +iy=4° 
es ee so) 


Describe the set of all complex numbers z such that 


| 
wv) |z—al+ |z—bl=c. 
Zp steal part.of z. 


Prove that |z| = |Z|, and that the real part of z 1s (¢+z)/2, while the imaginary 
pantus (z =z) /2¢. 


Prove that |z + w|? + |z — w|? = 2(|z|? + |w|?), and interpret this statement 
geometrically. 


What is the pictorial relation between z and Vi -z/—i? (Note that there 
may be more than one answer, because Vi and ¥—i both have two different 
possible values.) Hint: Which line goes into the real axis under multiplication 


by ¥—i? 


(a) Prove that if ag, ... , @,—1 are real and a + bi (for a and b real) satisfies 
the equation z” + d,-1z"~ 1 +--+ +ag = 0, then a — bi also satisfies this 
equation. (Thus the nonreal roots of such an equation always occur i 
pairs, and the number of such roots is even.) 

(b) Conclude that z” +a,_)Z"~'+---+ap is divisible by Ee ae 
(whose coefficients are real). 


(a) Let c be an integer which is not the square of another integer. If a and b 


: ; Pa sa 
are integers we define the conjugate of a + bc, denoted by a + bv. 
as a — b/c. Show that the conjugate is well defined by showing that a 
number can be written a + bJc, for integers a and b, in only one way. 


540 Infinite Sequences and Infinite Series 


*10. 


“AT. 


LZ 


(b) 


(c) 


Show that for all a and B of the form a+b /c, we have @ =a; & =a if 
and only if @ is an integer; a+ B =@+ 8; —w = —a;a-B =a- B; and 
a! =(a)-' if a £0. 

Erovemthat if doh... , @,— are Wileses anchc.—. a + bJ/c satisfies the 


equation z” + d,_1z"-! +--+ +49 = 0, then Z = a — bV© also satisfies 
this equation. 


Find all the 4th roots of i; express the one having smallest argument in a 
form that does not involve any trigonometric functions. 


(a) 
(b) 


Prove that if w is an nth root of 1, then so is aX. 

: Acne ea 9 oz 
A number @ is called a primitive nth root of | if {1,@.@?,...,@"7"} 
is the set of all nth roots of 1. How many primitive nth roots of | are 


therefor #= 394,95, 9? 


= 
Let w be an ath root of 1, with w 4 1. Prove that as =) 
k=0 
Prove that if 21, ... , 2x lie on one side of some straight hne through 0, 


then 2] +--+: +<24% #0. Hint: This is obvious from the geometric inter- 
pretation of addition, but an analytic proof is also easy: the assertion 1s 
clear if the line is the real axis, and a trick will reduce the general case 
to this one. 

Show further that z,7!, ... , 2,7! all lie on one side of a straight line 
through 0, so that zea ere ai a 


Prove that if |zj| = |z2| = |z3/' amd zg; + zo + €3 =O, then z, £9, and z3 are 
the vertices of an equilateral triangle. Hint: It will help to assume that z 1s 


real, and this can be done with no loss of generality. Why? 


CHAPTER 


COMPLEX FUNCTIONS 


You will probably not be surprised to learn that a deeper investigation of complex 
numbers depends on the notion of functions. Until now a function was (intuitively) 
a rule which assigned real numbers to certain other real numbers. But there is no 
reason Why this concept should not be extended; we might just as well consider a 
rule which assigns complex numbers to certain other complex numbers. A rigorous 
definition presents no problems (we will not even accord it the full honors of a 
formal defimition): a function is a collection of pairs of complex numbers which 
does not contain two distinct pairs with the same first element. Since we consider 
real numbers to be certain complex numbers, the old definition is really a special 
case of the new one. Nevertheless, we will sometimes resort to special terminology 
in order to clarify the context in which a function is being considered. A function 
f is called real-valued if f(z) is a real number for all z in the domain of f, and 
complex-valued to emphasize that it is not necessarily real-valued. Similarly, 
we will usually state explicitly that a function f is defined on [a subset of] R in 
those cases where the domain of f is [a subset of] R; in other cases we sometimes 
mention that f is defined on [a subset of] G to emphasize that f(z) is defined for 
complex z as well as real z. 

Among the multitude of functions defined on C, certain ones are particularly 
important. Foremost among these are the functions of the form 


| 


{@)=az fanz = - + an 


where ao, ... , @, are complex numbers. These functions are called, as in the 
real case, polynomial functions; they include the function f(z) = z (the “identity 
function”) and functions of the form f(z) =a for some complex number a (“con- 
stant functions”). Another important generalization of a familiar function is the 
“absolute value function” f(z) = |z| for all z in C. 

‘Two functions of particular importance for complex numbers are Re (the “real 
part function’) and Im (the “imaginary part function”), defined by 


Re(eee iy) =x, 
for x and y real. 
Im(x +iy) = y, : 
The “conjugate function” is defined by 


of Zaz —SRe(z) = 7 lin). 


Familiar real-valued functions defined on R may be combined in many ways to 
produce new complex-valued functions defined on Can example is the function 


: AY 5 2 
f(x+iy) =e’ sm(x — y) +ix" cosy. 


54] 
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The formula for this particular function illustrates a decomposition which is always 
possible. Any complex-valued function f can be written in the form 


f =utiv 


for some real-valued functions u and v—simply define u(z) as the real part of f(z), 
and v(z) as the imaginary part. ‘This decomposition is often very useful, but not 
always; for example, it would be inconvenient to describe a polynomial function 
in this way. 

One other function will play an important role in this chapter. Recall that an 
argument of a nonzero complex number z is a (real) number @ such that 


z = |z|(cos@ +7 sin @). 


There are infinitely many arguments for z, but just one which satisfies 0 < 6 < 
2m. If we call this unique argument 6(z), then @ is a (real-valued) function (the 
“argument function’) on {z in C': g # 0}. 

“Graphs” of complex-valued functions defined on C, since they he in 4-dimen- 
sional space, are presumably not very useful for visualization. ‘The alternative 
picture of a function mentioned in Chapter 4 can be used instead: we draw two 
copies of C, and arrows from z in one copy, to f(z) in the other (Figure 1). 


FeG Wise 


The most common pictorial representation of a complex-valued function is pro- 
duced by labeling a point in the plane with the value f(z), instead of with z (which 
can be estimated from the position of the point in the picture). Figure 2 shows this 
sort of picture for several different functions. Certain features of the function are 
illustrated very clearly by such a “graph.” For example, the absolute value function 
is constant on concentric circles around O, the functions Re and Im are constant 
on the vertical and horizontal lines, respectively, and the function f(z) = z? wraps 
the circle of radius r twice around the circle of radius r?. 

Despite the problems involved in visualizing complex-valued functions m gen- 
eral, it is stll possible to define analogues of important properties previously defined 
for real-valued functions on R, and im some cases these properties may be easier 
to visualize in the complex case. Mor example, the notion of limit can be defined 
as follows: 


lim f(z) =/ means that for every (real) number ¢ > 0 there ts a (real) number 


poate} 


5 > 0 such that, for all«,if 0 < |z—<a| < 46, then | f(z) —7| < «. 
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Although the definition reads precisely as before, the interpretation is slightly dif- 

ferent. Since |z — w| is the distance between the complex numbers z and w, the 

equation lim f(z) = / means that the values of f(z) can be made to lie inside 
rire) 


any given circle around /, provided that z is restricted to lie inside a sufficiently 
small circle around a. ‘This assertion is particularly easy to visualize using the “two 
copy” picture of a function (Figure 3). 


nn’ é 


WRG WIRES 


Certain facts about limits can be proved exactly as in the real case. In particular, 


lime ——c, 
La 
lim"= a, 
ae — {8 


lim [ f(z) + g(z)] = lim f(z) + lim g(z), 
lim J@)-2a= lim fa lim g(Z), 


| ] 
lim = ———4 fiftlim ee 0. 
za g(z) hin g(z) za 8(2) # 


The essential property of absolute values upon which these results are based is the 
inequality |z + w| < |z| + |w|, and this inequality holds for complex numbers as 
well as for real numbers. ‘These facts already provide quite a few limits, but many 
more can be obtained from the following theorem. 


THEOREM 1 Let f(c) = u(z) + fv(z) for real-valued functions uv and v, and let / = a + if for 
real numbers @ and B. ‘Then lim f(z) = / if and only if 
fe aat 


lim «(zy =o, 


Fama el 


limiv@) =, 


réales 


PROOF Suppose first that lim f(z) =/. If e > 0, there is 6 > 0 such that, for all z, 
aa Gs 
0 < |z—a| <6, then | f(z) —1| < «. 
The second inequality can be written 


[u(z) — a) + i[v(z) — p]| =e. 
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or 
9 9 9 
[u(z) —a]~ + [v(z) — Bl" <.e°. 
Since u(z) — @ and v(z) — B are both real numbers, their squares are positive; this 
inequality therefore implies that 
9 9 9 9 
[u(z) -—@]” <e- and [v(z)—B]° <e*, 
which implies that 
ju(z) —a|<e and _ |v(z)— pl <e. 
Since this is true for all ¢ > 0, it follows that 


limu(z)=q@ and lim v(z) = B. 


za 


i—a 


Now suppose that these two equations hold. If ¢ > 0, there is a 6 > 0 such that, 
for all z, if O < |z —a| < 6, then 


lu(z) —a@| < and |v(z) —a@| < 


A 


bo| 


which implies that 


lf (z) —1| = |[u(z) — a] +i[v(z) — B]| 
< |u(z) —a@| + || - |u(z) — Bl 
é& € 
== of 5 =e 


This proves that lim f(z) =. J 


In order to apply Theorem | fruitfully, notice that since we already know the 
limit lim z = a, we can conclude that 


Za 


lim Re(z) = Re(a), 


2->a 


lim Im(z) = Im(a). 


i? da 


A limit like 
lim sin(Re(z)) = sin(Re(a)) 


za 
follows easily, using continuity of sin. Many applications of these principles prove 
such limits as the following: 


lim |z| = |al, 
<a 


° a: ae “3 
lim  esinx + ix? cosy =e’ sina +ia° cosb. 
(x+ty)—a+bi 
Now that the notion of limit has been extended to complex functions, the notion 
of continuity can also be extended: f is continuous ata if lim f(z) = f(a), and 


a” 
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f is continuous if f is continuous at a for all a in the domain of f. The previous 
work on limits shows that all the following functions are continuous: 


fO=ac aoe 


face 
fO Shae 


. : ar a3 
Ge + iy =e si tox Gory: 


ae oo Se May. 


Examples of discontinuous functions are easy to produce, and certain ones come 
up very naturally. One particularly frustrating example is the “argument func- 
tion” #, which is discontmnuous at all nonnegative real numbers (see the “graph” 
in Figure 2). By suitably redefining 6 it is possible to change the discontinu- 
ities; for example (Figure 4), if 6’(z) denotes the unique argument of z with 
m/2 < 6'(z) < 5/2, then 6’ is discontinuous at ai for every nonnegative real 
number a. But, no matter how 6 is redefined, some discontinuities will always 
occur. 


WAM AMI ANM/ AM AM ANM| ANIA 


TT “laas0 aCe ileeesT IC Dye Dye Dee Dae Dae Dae Dae Dae ae 
@3n 


WwW 
a) 


n|¥ 


—e—_e—_e—_e 
Ww 
q 


FIGURE 4 J (i= 92) 


The discontinuity of @ has an important bearing on the problem of dehning a 
“square-root function,” that is, a function f such that (f(z)? = £ forall z. lor real 
numbers the function “had as domain only the nonnegative real numbers. If 
complex numbers are allowed, then every number has two square roots (except 0, 
which has only one). Although this situation may seem better, it is in some ways 
worse; since the square roots of z are complex numbers, there 1s no clear criterion 
for selecting one root to be f(z), m preference to the other. 


FIGURE 5 


la, b] x [c,d] 


a 


b 
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One way to define f is the following. We set f (0) = 0, and for z 4 0 we set 


fZ= Vizl (cos a + isin “) 


= =~ 


Clearly (f (z))* = <, but the function f is discontinuous, since @ is discontinuous. 
As a matter of fact, it is impossible to find a continuous f such that (f(z))* = < for 
all z. In fact, it is even unpossible for f(<) to be defined for all z with |z| = 1. To 
prove this by contradiction, we can assume that f(1) = | (since we could always 
replace f by —f). Then we claim that for all 9 with 0 < 6 < 27 we have 


mt Dd 


0 
(*) f(cos6 +7 sin@) = cos 5 -+ isin 


The argument for this is left to you (it is a standard type of least upper bound 
argument). But (*) implies that 


lim f(cos@ +isin@) = cosa +i sinz 
0@>2n 
— =| 


# f(), 


even though cos@ +isin@ — | as 6 — 2x. Thus, we have our contradic- 
tion. A similar argument shows that it is impossible to define continuous “wth-root 
functions” for any n > 2. 

For continuous complex functions there are important analogues of certain the- 
orems which describe the behavior of real-valued functions on closed intervals. A 
natural analogue of the interval |a@, b] is the set of all complex numbers z = x +/y 
with a <x < band c < y <d (Figure 5). This set is called a closed rectangle, 
and is denoted by [a, b] x [c. d]. 

If f is a continuous complex-valued function whose domam is [a,b] x [c,d], 
then it seems reasonable, and is indeed true, that f is bounded on [a, b] x [c, d]. 
That is, there is some real number M such that 


[f(z)| < M for all z in [a, b] x [c, d]. 


It does not make sense to say that f has a maximum and a minimum value on 
[a,b] x [c,d]. since there is no notion of order for complex numbers. If f is a 
real-valued function, however, then this assertion does make sense, and is true. In 
particular, if f is any complex-valued continuous function on [a,b] x [c,d]. then 
| f| is also continuous, so there is some zg in [a, b] x [e.d] such that 


Lf Ceol (e)| for alli< in fia, b] x [e. d]; 


a similar statement is true with the mequality reversed. It 1s sometimes satd that 


“f attains its maximum and mimuum modulus on [a,b] x [c. d] 
The various facts listed in the previous paragraph will not be proved here, al- 
though proofs are outImed in Problem 5. Assuming these facts, however, we can 
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THEOREM 2 (THE FUNDAMENTAL 
THEOREM OF ALGEBRA) 


PROOF 


now give a proof of the Fundamental ‘Theorem of Algebra, which is really quite 
surprising, since we have not yet said much to distinguish polynomial functions 


from other continuous functions. 


Let ao,...,@,—; be any complex numbers. Then there is a complex number z 
such that 
= a) 
2 ie. l nee! a ap = 0). 


Let 
f{ Ge eae eae 


Then f is continuous, and so is the function | f| defined by 
| fl) = | f (z)| = Ee Beals ap eee + ao|. 


Our proof is based on the observation that a point zo with f (zo) = 0 would clearly 
be a miminum point for | f|. ‘Lo prove the theorem we will first show that | f| does 
indeed have a smallest value on the whole complex plane. ‘Vhe proof will be almost 
identical to the proof, in Chapter 7, that a polynomial function of even degree 
(with real coefficients) has a smallest value on all of R; both proofs depend on the 
fact that if [z| is large, then | f(z)| ts large. 

We begin by writing, for z 4 0, 


pe | ag 
(1+ te 2), 


lf (z)| = IzI" - | ae 


F(z) 


so that 


et 
M = maxi W214 |, |p. a Zeal aa) 


Then for all z with |z| > M, we have |z*| > |z| and 


reara| = laa = lan—z| = 
[ze — tel ~~ 2ulagch 2m 
sO 
ay] ag By] do I 
mmc er | Ld 
” all rae = ah aa 2D 
which implies that 
An—| do re ao I 
4 7 sige et eel eal = le ioe = 5 


This means that ‘ 


LC COT a Ye 


a 


> /2\/ (0)| mthen 


heal (0). 


~ 


In particular, if |z| > M and also 
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Now let [a,b] x [c,d] be a closed rectangle (Figure 6) which contains {z : |z| < 


max(M, \/2|f(0)|)}, and suppose that the minimum of |f| on [a,b] x [c, d] is 
attained at zg, so that 


(anol = hitZie for z im a,b] x fic, dl. 
It follows, im particular, that | f(zo)| < | f(0)|. Thus 
(2) if |z| > max(M, ¥2|f(0)|), then | f(<)| = |f()| = If Gol. 


Combining (1) and (2) we see that | f(zo)| < | f(z)| for all z, so that | f| attains its 
minimum value on the whole complex plane at zo. 


max(M, v 2|f(0)| ) 


LF) FO) 
for z here 
FIGURE 6 
‘To complete the proof of the theorem we now show that f(zo) = 0. It ts 


convenient to introduce the function g defined by 


@(z). =. f(z + Zo): 


Then g is a polynomial function of degree n, whose mmimum absolute value 
occurs at 0. We want to show that g(0) = 0. 


Suppose instead that g(0) = a 4 0. If m is the smallest positive power of z 
which occurs in the expression for g, we can write 


I 
e(z) Sa Bz" fog Hees Fez", 


where B 4 0. Now, according to Theorem 25-2 there is a complex number y 
such that 
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Thenysetting' dic; y*, we have 


le(vz)| Slot By 2” + dingiz! e+ + d,2" 
— la = az!" + eee AF Saha. Wa 


e ( _ re | +) | 


> 


II 
g 
= 
le 
+ 
e ~ 
cen a! 
S 
if 
(Gi 
(LE ——_]] 


This expression, so tortuously arrived at, will enable us to reach a quick contra- 
diction. Notice first that if |z| is chosen small enough, we will have 


Opes _ 
(64 


tll. 


If we choose, from among all z for which this inequality holds, some z which is real 
and positive, then 


Ts Gig ot » \2”"| =, il 
~ ~ ~ men 
a 
Consequently, if 0 < z < 1 we have 
d, d. 
pee [let Jf eel [tes] 
a 
a | Sette do ae | | 
a 


< ] = ae alts zo 


This is the desired contradiction: for such a number z we have 


le(vz)| < lo 


5 


contradicting the fact that |@| is the minimum of |g| on the whole plane. Hence, 
the original assumption must be incorrect, and g(0) = 0. This implies, finally, that 


Acca 


Even taking into account our omission of the proofs for the basic facts about 
continuous complex functions, this proof verified a deep fact with surprisingly 
little work. It is only natural to hope that other interesting developments will arise 
if we pursue further the analogues of properties of real functions. ‘The next obvious 
step is to define derivatives: a function f is differentiable at a if 


. fla+z)—- fla) 
be aa 


exists, 
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in which case the limit is denoted by f’(a). It is easy to prove that 


i= 0 ty ae 
fla=1 ta =F. 
(f +) (a) = f'(a) + 8'(a), 
(f - 8) (a) = f'(a)g(a) + flayg'(a), 


i \” —g'(a) ; 
=a) ———_ f 0, 
8 ) [g(a)]? ae 


(f og)(a) = f'(g(a)) - g'(a): 


the proofs of all these formulas are exactly the same as before. It follows, im 
particular, that if f(z) = 2", then f’(z) = nz"-!. These formulas only prove the 
differentiability of rational functions however. Many other obvious candidates are 
not differentiable. Suppose, for example, that 


Gitte 1 )) = Nerd Yo {1.€:, of (2) = ©). 


If f is to be differentiable at 0, then the limit 


peasy) = FQ) x—-iy 
im = ——————— = _ lim ; 
(xt+iy) 0 eat ky «+iy)2>0 ay 
must exist. Notice however, that 
i" 
ii —UPthen —4— 
AX Pry 
and 
* x-iy 
it 4 — 0. then — = —]; 
arr oe 


therefore this limit cannot possibly exist, since the quotient has both the values | 
and —1 for x + iy arbitrarily close to 0. 

In view of this example, it is not at all clear where other differentiable functions 
are to come from. If you recall the definitions of sin and exp, you will sce that 
there is no hope at all of generalizing these definitions to complex numbers. .\t 
the moment the outlook is bleak, but all our problems will soon be solved. 


PROBLEMS 


1. (a) For any real number y, define a(x) = x +/7y (so that @ Is a complex- 
valued function defined on R). Show that @ is continuous. (This follows 
immediately from a theorem in this chapter.) Show similarly that B(y) = 
x +7y is continuous. 

(b) Let f be a continuous function defined on C. For fixed y, let giv) = 
f(x +iy). Show that g is a continuous function (defined on Rj. Show 
similarly that h(y) = f(x +/y) 1s continuous. Hint: Use part (a). 


2. (a) Suppose that f is a continuous real-valued function defined on a closed 
rectangle [a,b] x[c,d]. Prove that if f takes on the values f(<) aud f(w) 
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(b) 


*(b) 


for z and w in [a,b] x [c,d], then f also takes all values between f(z) 
and f(w). Hint: Consider g(t) = f(tz + (1 — 1)w) for t in [0, 1]. 

If f is a continuous complex-valued function defined on [a,b] x [c, d], 
the assertion in part (a) no longer makes any sense, since we cannot talk 
of complex numbers between f(z) and f(w), We might conjecture that 
f takes on all values on the line segment between f(z) and f(w), but 
even this is false. Find an example which shows this. 


Prove that if ag, ..., @,-) are any complex numbers, then there are 
complex numbers Zz], ... , Zn (not necessarily distinct) such that 


n 
2" + dy-pz" | +++ Fag = | [« = ay 
iI 
Prove that if ao, ... , @,-1 are real, then z” +. a,_;z"~! +.--- + a9 can be 
written as a product of linear factors z+a and quadratic factors z*-+az+b 
all of whose coefficients are real. (Use Problem 25-7.) 


In this problem we will consider only polynomials with real coefficients. Such 
. . mie . ta | 

a polynomial is called a sum of squares if it can be written as y7+---+h,7 

for polynomials 4; with real coefficients. 


(a) 
(b) 
(c) 


(a) 


Prove that if f is a sum of squares, then f(x) > 0 for all x. 
Prove that if f and g are sums of squares, then so is f : g. 


Suppose that f(x) > 0 for all x. Show that f is a sum of squares. Hint: 
k 


First write f(x) = [ [@ — aj)" g(x), where g(x) > 0 for all x. Then use 
f=: 

Problem 3(b). 

Let A be a set of complex numbers. A number z is called, as in the 
real case, a limit point of the set A if for every (real) ¢ > 0, there 1s 
a point a in A with |z —a| < € but < 4a. Prove the two-dimensional 
version of the Bolzano-Weierstrass ‘Theorem: If A is an infinite subset 
of [a,b] x [c,d], then A has a limit point in [a,b] x [c,d]. Hint: First 
divide [a, b] x [c,d] in half by a vertical line as in Figure 7(a). Since A 
is infinite, at least one half contains mfinitely many points of A. Divide 
this m half by a horizontal line, as in Figure 7(b). Gontinue in this way, 
alternately dividing by vertical and horizontal lines. 


(The two-dimensional bisection argument outlined in this hint is so stan- 
dard that the title “Bolzano-Weierstrass” often serves to describe the 
method of proof, in addition to the theorem itself. See, for example, 
H. Petard, “A Contribution to the Mathematical ‘Theory of Big Game 
Hunting,” Amer Alath. Monthly, 45 (1938), 446 447.) 

Prove that a continuous (complex-valued) function on [a,b] x |e, d] 1s 
bounded on [a,b] x [c,d]. (Imitate Problem 22-31.) 

Prove that if f is a real-valued continuous function on [a,b] x [c¢, d], 
then f takes on a maximum and minimum value on [a,b] x [c,d]. (You 
can use the sanie trick that works for ‘Theorem 7-3.) 


(a) a convex subset of the plane 


(b) a nonconvex subset of the plane 
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*6. ‘The proof of Theorem 2 cannot be considered to be completely elementary 
because the possibility of choosing y with y’” = —a/f depends on Theo- 


rem 25-2, and thus on the trigonometric functions. It is therefore of some 


interest to provide an clementary proof that there is a solution for the equa- 
tion, =e =) 


(a) 
(b) 


(c) 


Make an explicit computation to show that solutions of <7 — ¢ = 0 can 
be found for any complex number c. 

Explain why the solution of z<” —c = 0 can be reduced to the case where 
nis odd. 


Let zo be the point where the function f(z) = 2” 


—c has its minimum 
absolute value. If zo 4 0, show that the integer m in the proof of Theo- 
rem 2 is equal to 1; since we can certainly find y with y! = —e/B, the 
remainder of the proof works for f. It therefore suffices to show that the 
minimum absolute value of f does not occur at 0. 

Suppose instead that f has its minimum absolute value at 0. Since 77 is 
odd, the points +6, +67 go under f into —c+6", —c+6"1. Show that for 
small 6 at least one of these points has smaller absolute value than —c, 
thereby obtaming a contradiction. 


vie) =) Ys. .- (7 — 2)" for mi... ene = 0. 
k 
Show that f’(z) = (z—z)"!-...-(2 — 2%) > male _ wa) eo 
a=] 
k 
ete) = Yo male — zy)” !. Show that if oes OS them) ees 
wl 


cannot all lic on the same side of a straight line through z. Hint: Use 
Problem 25-11. 

A subset K of the plane is convex if K contains the line segment joming 
any two points in it (Figure 8). For any set A, there is a smallest convex 
set containing it, which is called the convex hull of A (Figure 9); ifa 
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aes 


10. 


point P is not in the convex hull of A, then all of A is contained on one 
side of some straight line through P. Using this information, prove that 
the roots of f’(z) = 0 le within the convex hull of the set {z),..., zx}. 
Further information on convex sets will be found in reference [18] of the 
Suggested Reading. 


Prove that if f is differentiable at z, then f is continuous at z. 


Suppose that f = u +iv where uw and v are real-valued functions. 


(a) 


(b) 


For fixed yo let g(x) = u(x + fyo) and h(x) = v(x + iyo). Show that if 
f' (xo + iyo) = a + if for real a and B, then g’(xo) = @ and h'(xo) = B. 
On the other hand, suppose that k(y) = u(xg+ty) and /(y) = v(xo+iy). 
Show that /'(yo) = @ and k’(yo) = —f. 

Suppose that f’(z) = 0 for all z. Show that f is a constant function. 


Using the expression 


F(x) | | ( l l ) 
x)= eae ae x , 
L+x° 2Bi\x-—i x+i 
find f(x) for all k. 

Use this result to find arctan? (OQ) for all k. 
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COMPLEX POWER SERIES 


If you have not already guessed where differentiable complex functions are gomg 
to come from, the title of this chapter should give the secret away: we intend to 
define functions by means of infinite series. ‘This will necessitate a discussion of 
infinite sequences of complex numbers, and sums of such sequences, but (as was 
the case with limits and continuity) the basic definitions are almost exactly the 
same as for real sequences and series. 

An infinite sequence of complex numbers is, formally, a complex-valued func- 
tion whose domain is N; the convenient subscript notation for sequences of real 
numbers will also be used for sequences of complex numbers. A sequence {a,} of 
complex numbers is most conveniently pictured by labeling the points a, in the 
plane (Figure 1). 

The sequence shown in Figure | converges to 0, “convergence” of complex 
sequences being defined precisely as for real sequences: the sequence {dy} 
converges to /, in symbols 


hinea, I 
I= 19,0) 


if for every € > O there is a natural number N such that, for all 7, 
it n> N. then ‘la, —/| < €. 


This condition means that any circle drawn around / will contain a, for all sufh- 
ciently large n (Figure 2); expressed more colloquially, the sequence is eventually 
inside any circle drawn around 1. 

Convergence of complex sequences is not only defined precisely as for real 
sequences, but can even be reduced to this familiar case. 


et 
a, — 2, 1c, for realep, andic. 
and let 
l=Bt+iy for real B and y. 


Then lim a, = / if and only if 
n—- Co 


lin by =p and inn erene— bt 

noo n—Co 
The proof is left as an easy exercise. If there 1s any doubt as to how to proceed, 
consult the similar Theorem | of Chapter 26. § 


The sum of a sequence {a,} is defined, once again, as lim s,, where 
RCO 


Sp HO ap °°° SP Gre 


mn 
in 
nm 
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THEOREM 3 


PROOF 


Sequences for which this limit exists are summable; alternatively, we may say that 


the mfinite series ) a, converges if this limit exists, and diverges otherwise. It 
n=] 

is unnecessary to develop any new tests for convergence of infinite series, because 

of the followmg theorem. 


Let 
CG, =O iG, for realy, andic,. 
fora) we oS 
Then or converges if and only if SS b, and 1S c, both converge, and in this 
n=] n=l =| 
case 
Seu +i( oa): 
n=] n=l n=] 


This is an inmediate consequence of Theorem | applied to the sequence of partial 
sums of {an}. ff 


There is also a notion of absolute convergence for complex series: the series 


) a, converges absolutely if the series ) |a,| converges (this 1s a series of real 


n=1 n=l 
numbers, and consequently one to which our earlier tests may be applied). The 


ollowing theorem is not quite so casy as the preceding two. 
foll g theorem is not quit y as tl ding t 


Let 
a 0) IC, for real b, and c,. 
9° oo OX 
‘Then Na, converges absolutely if and only if a: and Ne both converge 
n=l n=] n=] 
absolutely, 


Suppose first that Pees and ~~ c, both converge absolutely, 1.e., that S~ |b, | and 


n=] n=1 | 
ee lcn| both converge. It follows that 3 lbn| + |cn| converges. Now, 
n=1 n=| 
lan | = lb, Se LG), = lbn| alg (Cuse 
ie.4) 


It follows from the comparison test that ) la,| converges (the numbers |a,| and 


n=] 
OO 


|b» | + |e, | are real and nonnegative). ‘Thus ) a, converges absolutely. 


n=1 
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le @) 
Now suppose that ) lan| converges. Since 


n=] 


lay | = b,* eta Cn, 


it is Clear that 
lopli =a, sand le>| = laa 
CO OO 
Once again, the comparison test shows that |b,| and S lcn| converge. J 


n=1 n=1 


o@) 
‘Two consequences of ‘Theorem 3 are particularly noteworthy. If ) An COn- 


n=1 


le,2) & 
verges absolutely, then ) b, and ) Cn also converge absolutely; consequently 


n=l I 


lo 2) oO lo @) 
) b, and ) cn converge, by Theorem 23-5, so ) a, converges by Theorem 2. 
n=1 Hil n=] 
In other words, absolute convergence implies convergence. Similar reasoning 


shows that any rearrangement of an absolutely convergent series has the same 
sum. These facts can also be proved directly, without using the corresponding the- 
orems for real numbers, by first establishing an analogue of the Cauchy criterion 
(see Problem 13). 

With these preliminaries safely disposed of, we can now consider complex 
power series, that is, functions of the form 


f() = RAC = @)E= ap + a (ee) at aaa) te 
n=0 


Here the numbers a and a, are allowed to be complex, and we are naturally 
interested in the behavior of f for complex z. As in the real case, we shall usually 
consider power series centered at 0, 


ie = eens 


n=0 


in this case, if f (zo) converges, then f(z) will also converge for |z| < |zo|. The 
proof of this fact will be similar to the proof of Theorem 24-6, but, for reasons 
that will soon become clear, we will not use all the paraphernalia of uniform con- 
vergence and the Weierstrass M-test, even though they have complex analogues. 
Our next theorem consequently generalizes only a small part of Theorem 24-6. 


Suppose that 


CO 

; 
Gn20 = a9 + a 20 + a220° + °° 
n=O 
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FIGURE 3 


converges for some zo 4 0. ‘Then if |z| < |zo|, the two series 


lo) 

9} 
Da. = 80 12 4 oe a 
n=0 


a 
— 3) 
» Han” _— On ae 202Z =F 3a3z~ +... 


n=1 


both converge absolutely. 


As in the proof of Theorem 24-6, we will need only the fact that the set of numbers 
Qnzo0" 1s bounded: there is a number M such that 


la, Zoe Af for all n. 
We then have 
a n 
ey = = th = 
lanz" | = |anzo" | aa 
«0 
— i 
<a | 
rae) 
and, for z 40; 
] - nt 
n—ly __ = del < 
Nanz | = Salanzo-| + | — 
z| £0 
M a n 
<=" |— |. 
zl | Zo 
oO oO OO 
Since the series ) \z/zo|" and ) n |z/zo|" converge, this shows that both ) Gan 
n=0 n=) n=0 
OO OO 
and ) nanz"' converge absolutely (the argument for ) na,z" | assumed that 
n=l n=l 


z #0, but this series certainly converges for z = 0 also). 


Theorem 4 evidently restricts greatly the possibilities for the set 


OO 
|: > an2" converges} 
n=0 
OO 
For example, the shaded set A in Figure 3 cannot be the set of all z where Se 
n=0 


> 
~ |: 


converges, since it contains z, but not the number w satisfying |w| < 

It seems quite unlikely that the set of points where a power series converges 
could be anything except the set of poimts inside a circle. If we allow “circles of 
radius 0” (when the power series converges only at 0) and “circles of radius 00” 
(when the power series converges at all pots), then this assertion is true (with one 
complication which we will soon mention); the proof requires only ‘Theorem 4 and 
a knack for good organization. 
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For any power series 


[o@) 

9) 
arene = ag + ayz + a2z~ + 032° +:-. 
n=0 


one of the following three possibilities must be true: 


[e.e) 
(1) Le a,z" converges only for z = 0. 


n=0 
fore) 


(2) y anz" converges absolutely for all z in C. 


n=0 
fo) 


(3) There is a number R > O such that Sak” converges absolutely if |z| < R 


n=0 
and diverges if |z| > R. (Notice that we do not mention what happens 


when [z| = R.) 


Let 


CO 
— |: nR: ) d,w" converges for some w with |w| = x | 
n=0 
Suppose first that S is unbounded. Then for any complex number z, there ts 
CO 
a number x in S such that |z| < x. By definition of S, this means that ) “a 


n=0 
oe) 


converges for some w with |w| =x > |z|. It follows from Theorem 4 that > aya 
n=0 
converges absolutely. ‘Thus, m this case possibility (2) is true. 
Now suppose that S is bounded, and let R be the least upper bound of S. If 
oe) 


R = 0, then ars converges only for z = 0, so possibility (1) is true. Suppose, 


n=0 
on the other hand, that R > 0. Then if z is a complex number with |z| < R, there 
oO 


1s a number x in S with [z| < x. Once again, this means that ) d,w" converges 


n=0 
o.@) 
for some w with |z| < |w|, so that ) a,z" converges absolutely. Moreover, if 
n=0 


[e.@) 
lz| > R, then ) ayz" does not converge, since |z| is not in S. § 
n=0 


The number R which occurs in case (3) 1s called the radius of convergence of 
io, @) 
) a,z". In cases (1) and (2) it is customary to say that the radius of convergence 
n=0 
is" fmidloo, respectively “When OP = R < oo, the cwele {z > |z|'="R} is*calléd 
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1 
the terms a,,z”" are not bounded 


circle of 
convergence 


converges 
absolutely 


FIGURE 4 


Oo 


the circle of convergence of ) ayz'. If z is outside the circle, then, of course, 
ss p n=( 

) anz does not converge, but actually a much stronger statement can be made: 
n=0 


the terms a,z" 


are not even bounded. ‘lo prove this, let w be any number with 
lz| > |w| > R; ifthe terms a,z” were bounded, then the proof of Theorem 4 would 


[o@) 
show that ) d,w" converges, which is false. Thus (Figure 4), inside the circle of 
n=0 ? = 7 : ; 
convergence the series ) a,z converges in the best possible way (absolutely) and 
n=0 


oN 


outside the circle the series diverges in the worst possible way (the terms a,z 
not bounded). 
What happens on the circle of convergence is a much more difficult question. 


are 


We will not consider that question at all, except to mention that there are power 
series which converge everywhcre on the circle of convergence, power series which 
converge nowhere on the circle of convergence, and power series that do just about 
anything in between. (See Problem 5.) 


Algebraic manipulations on complex power series gi be justified just as in the 
OO 


neal cases huss ifs f(z). = > val and g(z) Sar both have radius of 
n=0 n=0 


(oe) 
convergence == K athen*i(z)" = Sian + b,)z" also has radius of convergence 
> Rand h = f +g inside the citalt of radius R. Similarly, the Cauchy product 


(oe) n 


A(z) = ) Guat, = Yo akbn—k, has radius of convergence > R and h = fg 
n=0 k=0 


love) 
inside the circle of radius R. And if f(z) = ) Qnz" has radius of convergence > 0 
n=0 


[o,@) 
and ag # QO, then we can find a power series ) b,z" with radius of convergence 


n=0 
> 0 which represents |/f inside its circle of convergence. 


But our real goal in this chapter is to produce differentiable functions. We 
therefore want to generalize the result proved for real power series in Chapter 24, 
that a function defined by a power series can be differentiated term-by-term in- 
side the circle of convergence. At this point we can no longer imitate the proof of 
Chapter 24, even if we were willing to introduce uniform convergence, because no 
analogue of Theorem 24-3 seems available. Instead we will use a direct argument 
(which could also have been used in Chapter 24). Before begmning the proof, 


we notice that at least there is no problem about the convergence of the series 
[o@) 


produced by term-by-term differentiation. If the series @,z" has radius of con- 
n=O 00 

vergence R, then ‘Theorem 4 mnmediately implies that the series 3 nanz” | also 
n=l 

. Moreover, if |z| > R, so that the terms a,z” are unbounded, 
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[e,@) 


1 


then the terms na,z”~* are surely unbounded, so ) NAnz | does not converge. 


ial 


oO 
This shows that the radius of convergence of i nana is also exactly R. 


a 


If the power series 
[o.@) 
(O= > Ge 
n=0 


has radius of convergence R > O, then f is differentiable at z for all z with |z| < R, 


and 
le @) 


VGQl= tee 


n=1 


We will use another “e/3 argument.” The fact that the theorem is clearly true for 
polynonual functions suggests writing 


zth)—-f) << = zth)y"-2) = 
n=l n=0 n=! 
oe) N 
(zee) 2") (ce h) = @ ) 
a > ae ae “eee ine See ee 
= ee ;; Gk , 
n=O n=0 
N N 
(ZF ih) ee) n—-] 
op = n& 
Fatwa 
n=0 n=] 
N lee) 


We will show that for any ¢ > 0, each absolute value on the right side can be made 
< €/3 by choosing N sufficiently large and h sufficiently small. This will clearly 
prove the theorem. 

Only the first term in the right side of (*) will present any difficulties. ‘To begin 
with, choose some zo with |z| < |zo| < R; henceforth we will consider only h 
with |z + h| < |zo|. The expression ((z +)” — z")/h can be written in a more 
convenient way if we remember that 

x" = bal n—1 n—2 n—-3,,2 n—| 
ea Ye 
a 
Applying this to 
(Gapdl)iae. muletey na 0B 
h — &+hy—z" 
we obtain 


et) ee : L ; 
AN a+" MEL ciel eee et eee ae 
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Since A 
(zthy + 2(z thy? te $2!| <alzol" |, 


we have 


< nlan| + |\zo|"—*. 


| ((z +h)" — z") 
h 


co 
But the series ) nlay| - lzol"—! converges, so if N is sufficiently large, then 


n=1 
ee) 


>> nlan| -lzol"" < 


n=N+1 


Go] 


This means that 


lo, 3) 


N 
((z +h)" —z") ((z +h)" — z") 
Lee Sf 


2 % 


pe + 4, 2 9 


n=N+1 n=N+1 


fe 6) 
=), Blan lzol"* < 


n=N+1 


((z +h)" — z") 
h 


é 
3 
In short, if N is sufficiently large, then 


fo 2) 


(e+h)* —2") eth = 2") 
(1) aa = 2 < 
n=0 n=0 


for all h with |z + h| < |zo|. 


Ww] 


It is easy to deal with the third term on the right side of (*): Since ye nanz" | 


converges, it follows that if N is sufficiently large, then ps 


00 N . 
(2) ) Was” = ) na,z"'| < 3: 
n=1 n=1 7 


Finally, choosing an N such that (1) and (2) are true, we note that 


/ 


N N 


} ((z +h)” — z") acy 
lim a,——————“—— = RG,Z > 
h>0 h 


n=0 n=1 
N 


since the polynomial function g(z) = y a,z" is certainly differentiable. ‘Therefore 
n=0 


N N 


Ay ((Z +h)" = z”) | é 
3) | SS nae"!| < 5. 


n=0 n=] 
for sufficiently small h. 


As we have already indicated, (1), (2), and (3) prove the theorem. M 
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Theorem 6 has an obvious corollary: a function represented by a power series 1s 
infinitely differentiable inside the circle of convergence, and the power series is its 
‘Taylor series at 0. It follows, in particular, that f is continuous inside the circle of 
convergence, since a function differentiable at z is continuous at z (Problem 26-8). 

‘The continuity ofa power series inside its circle of convergence helps explain the 
behavior of certain ‘Taylor series obtained for real functions, and gives the promised 
answers to the questions raised at the end of Chapter 24. We have already seen 
that the ‘Taylor series for the function f(z) = 1/(1 + oN, namely, 


9) 
lar zt— ote, 


converges for real z only when |z| < 1, and consequently has radius of conver- 
gence |. It is no accident that the circle of convergence contains the two points 
i and —i at which f is undefined. If this power series converged in a circle of 
radius greater than 1, then (Figure 5) it would represent a function which was 
continuous in that circle, in particular at i and —i. But this is impossible, since it 
equals 1/(1 + z”) inside the unit circle, and 1/(1 +z”) does not approach a limit 
as z Ss approaches 7 or —i from inside the unit circle. 

The use of complex numbers also sheds some light on the strange behavior of 
the ‘Taylor series for the function 


ee ea. 
OF seals 


Although we have not yet defined e* for complex z, it will presumably be true that 
if y is real and unequal to 0, then 


fey =| 


fliy) =e MOP = el? 


The interesting fact about this expression is that 1t becomes large as y becomes 
small. ‘Thus f will not even be continuous at 0 when defined for complex numbers, 
so it is hardly surprising that it is equal to its ‘Taylor series only for z = 0. 

The method by which we will actually define e* (as well as sin z and cos z) for 
complex z should by now be clear. For real x we know that 


moe 
mx=x-—+—-:::,~ 
al 31 SI 
9 
1x x4 
cosSx = Neal qe 
2 
Fie en 
Yee 
For complex z we therefore define 
ad ZS 
snz=2—-—+—-—::-, 
Sie! 
ae -4 
Cos ee ales tae: 
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Then sin'(z) = cosz, cos’(z) = —sinz, and exp’(z) = exp(z) by Theorem 6. 
Moreover, if we replace z by iz in the series for e*, and make a rearrangement 
of the terms (justified by absolute convergence), something particularly interesting 


happens: 
eal C2) Mn 
tz = AS 
e*=1+iz+ oT air 31 A 5) ap 
; o aeeie 
Se oe pe a 
2 eed Pm. 25 
= La oi ard Stan at ee , 
SO 


e’* = cosz + isin z. 


It is clear from the definitions (1.e., the power series) that 


sin(—z) = — sin z, 
cos(—z) = cos zZ, 


so we also have 


iz 


e ® =cosz —isinz. 


i 


From the equations for e’= and e~ we can derive the formulas 


: e-—e © 
sin Zz = —————_-,, 
D5 
ee tel 
cos Z = = 


The development of complex power series thus places the exponential function at 
the very core of the development of the elementary functions—it reveals a con- 
nection between the trigonometric and exponential functions which was never 
imagined when these functions were first defined, and which could never have 
been discovered without the use of complex numbers. As a by-product of this 
relationship, we obtain a hitherto unsuspected connection between the numbers e 
and z: if in the formula 


e’* =cosz+isinz 


we take z = 7, we obtam the remarkable result 


Dee He 
2ni/ 5 an mth root of By 


(More generally, e 
With these remarks we will bring to a close our investigation of complex func- 
tions. And yet there are still several basic facts about power series which have not 
been mentioned. ‘Uhus far, we have seldom considered power series centered at a, 
S 
f(2) =) an(z — a)", 


n=0 


FIGURE 6 
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except for a = 0. This omission was adopted partly to simplify the exposition. 
kor power series centered at a there are obvious versions of all the theorems in 


this chapter (the proofs require only trivial modifications): there is a number R 
foe) 


(possibly 0 or oo”) such that the series Y an(z — a)" converges absolutely for z 


n=0 
with |z — a| < R, and has unbounded terms for z with |z — a| > R; moreover, for 


all z with |z — a| < R the function 


le @) 
Ae = DS an @ aa a)" 
n=0 
has derivative 
OO 
lye n—-| 
f (= nan = Cae 
n=] 


It is less straightforward to investigate the possibility of representing a function 
as a power series centered at b, if it is already written as a power series centered 


ater. ote 


[e,@) 


F(z) — ) GAZ —a)" 
n=0 
has radius of convergence R, and b is a point with |b — a| < R (Figure 6), then it 
is true that f(z) can also be written as a power series centered at b, 


(Zz) = So balc a by" 


n=0 


(the numbers b, are necessarily f“(b)/n!); moreover, this series has radius of 
convergence at least R — |b — a| (et may be larger). 

We will not prove the facts mentioned in the previous paragraph, and there are 
several other important facts we shall not prove. For example, if 


ce) — ane —@)" and 1S Ca ay (z—b)", 
n=0 n=0 


and g(b) = a, then we would expect that f o g can be written as a power series 
centered at b. All such facts could be proved now without introducing any basic 
new ideas, but the proofs would not be as easy as the proofs about sums, products 
and reciprocals of power series. ‘The possibility of changing a power series centered 
at a into one centered at b is quite a bit more involved, and the treatment of 
f og requires sul} more skill. Rather than end this section with a four de force 
of computations, we will mstead give a preview of “complex analysis,” one of 
the most beautiful branches of mathematics, where all these facts are derived as 
straightforward consequences of some fundamental results. 

Power series were introduced in this chapter in order to provide complex func- 
tions which are diflerentiable. Since these functions are actually infinitely differ- 
entiable, it is natural to suppose that we have therefore selected only a very special 
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collection of differentiable complex functions. ‘The basic theorems of complex 
analysis show that this is not at all true: 


If a complex function is defined in some region A of the plane and 1s differentiable in A, 
° then it is automatically infinitely differentiable in A. Moreover, for each point ain A the 
‘Taylor series for f at a will converge to f in any circle contained in A (Figure 7). 


‘These facts are among the first to be proved in complex analysis. It is impossible 
to give any idea of the proofs themselves— the methods used are quite different 
from anything m elementary calculus. If these facts are granted, however, then 
the facts mentioned before can be proved very easily. 

Suppose, for example, that f and g are functions which can be written as power 
series. Then, as we have shown, f and g are differentiable—it then follows from 
easy general theorems that f + g, f- g, 1/g and f o g are also differentiable. 
alate a Appealing to the results from complex analysis, it follows that they can be written 

as power series. 

We already know how to compute the power series for f +g, f-g and 1/g from 
those for f and g. It is also easy to guess how one would compute an expression 
for fog asa power series In (z—b) when we are given the power series expansions 

‘e.@) 
f (2) = Sante an a)" 


n=0 
CO 
g(z) = > be(z — by". 
k=0 
with a = g(b) = bg, so that 


ee) 
g(z)-a=) b(z — by. 
Kl 


First of all, we know how to compute the power series 


(g(z) — a)! = (Some = my) 
k=] 


and this power series will begin with (z — by. Consequently, the coefficient of 2” 


l 


m P 
6) 


Bee l 
f(g@) =) a(g(2) —a) 
i=) 
can be calculated as a finite sum, involving only coefficients arising from the first 7 
powers of g(z) —a. 
Sunilarly, if 


OO 


ean S > anlz —a)" 


n=0 
has radius of convergence R, then f is diflerentiable in the region A = {z : |z—a| < 
R}. ‘Vhus, if b is in A, it ts possible to write f as a power series centered at b, 


FIGURE 8 
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which will converge in the circle of radius R — |b — aj. The coefficient of z” 
will be f(b)/n!. This series may actually converge in a larger circle, because 
[e,@) 


DO ale — a)" may be the series for a function differentiable mn a larger region 
n=0 
than A. For example, suppose that f(z) = 1/(1 + 27). Then f is differentiable, 


except at ¢ and —7, where it is not defined. Thus f(z) can be written as a power 

[o.@) 

series Gaz” 
n=0 

a2, =(—1)" and a, = 0 if k is odd). It is also possible to write 


with radius of convergence | (as a matter of fact, we know that 


Te) = So bal: = ie 


n=0 
where D, =! (m1) /n!. We can easily predict the radius of convergence of this 


series: itis V1 + (5), the distance from f to i or —i (Figure 8). 

As an added incentive to investigate complex analysis further, one more result 
will be mentioned, which lies quite near the surface, and which will be found in 
any treatment of the subject. 

For real < the values of sinz always lie between —1 and 1, but for complex z 
this is not at all true. In fact, if s =iy, for y real, then 

eifly) _ e-tiy) a) =o 
Siny oF = ao 
If y 1s large, then sin7y is also large in absolute value. ‘This behavior of sin is typical 
of functions which are defined and differentiable on the whole complex plane (such 
functions are called entire). A result which comes quite early in complex analysis is 
the following: 


Liowille’s Theorem: The only bounded entire functions are the constant functions. 


As a simple application of Liouville’s Theorem, consider a polynomial function 
oe nl 
vhalcai) — a + dy—12" =F BAe aalite 


where x > I, so that f is not a constant. We already know that f(z) is large for 
large z, so Liouville’s Theorem tells us nothing interesting about f. But consider 
the function | 


1) 

If f(z) were never 0, then g would be entire; since f(z) becomes large for large z, 
the function g would also be bounded, contradicting Liouville’s Theorem. Thus 
f(z) = 0 for some z, and we have proved the Fundamental Theorem of Algebra. 


g(z) = 


PROBLEMS 


1. Decide whether each of the followmg series converges, and whether it con- 
verges absolutely. 
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n=l 
OO * 
. 1+ 2i 
(11) 
‘ >: Qn 
n=1 
oo i” 
ML —, 
(iii) os 
n=l 
oo 
iv) SOG +50" 
n= 
| logn ,, logn 
(v) i 
=e n n 


2. Use the ratio test to show that the radius of convergence of each of the 
following power series is 1. (In each case the ratios of successive terms will 
approach a limit < | if |z| < 1, but for |z| > 1 the ratios will tend to oo or 
to a limit > 1.) 


n=1 
3. Use the root test (Problem 23-9) to find the radius of convergence of each of 


the following power series. (In some cases, you will need limits derived in the 
problems to Chapter 22.) 


n=] 
OO 
cate nz" 
(1) ) 
n” 
n=l 
ne 
\ n 
(IV) te 
aN es Qn 
n=1 
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‘The root test can always be used, in theory at least, to find the radius of 
convergence of a power series; in fact, a close analysis of the situation leads 
to a formula for the radius of convergence, known as the “Cauchy-Hadamard 
formula.” Suppose first that the set of numbers %/Ja,| is bounded. 


co 
(a) Use Problem 23-9 to show that if lim 7/la,||z| < 1, then ees con- 
n> CO 


n=0 
verges. 


oO 
(b) Also show that if lim ¥/Ja,||z| > 1, then ee. has unbounded terms. 
n> oO 


n=0 
Oo 
(c) Parts (a) and (b) show that the radius of convergence of > am is 
n=0 


1/ lim */Ja,| (where “1/0” means “oo”). To complete the formula, de- 
n> Oo 


fine lim /la,| = 00 if the set of all ¥/|a,| is unbounded. Prove that in 
n—©OoO 
‘o @) 
this case, ) a,z" diverges for z 4 0, so that the radius of convergence 
n=0 


is O (which may be considered as “1/00”’). 


Consider the following three series from Problem 2: 


ia Sa 2 =n 
: : oe 
i=l n? n=] ie n=] 
Prove that the first series converges everywhere on the unit circle; that the 
third series converges nowhere on the unit circle; and that the second series 
converges for at least one point on the unit circle and diverges for at least 
one point on the unit circle. 


(a) Prove that e® -e” = e*t” for all complex numbers z and w by showing 


that the infinite series for e**" is the Cauchy product of the series for e* 
aincUiag 5 

(b) Show that sin(z + w) = sinzcosw + coszsinw and cos(z + w) = 
cos zcos w — sin zsin w for all complex z and w. 


(a) Prove that every complex number of absolute value | can be written e!” 
for some real number y. 
(b) Prove that |e*t'’”| = e* for real x and y. 


(a) Prove that exp takes on every complex value except 0. 
(b) Prove that sin takes on every complex value. 


For each of the following functions, compute the first three nonzero terms of 
the ‘Taylor series centered at 0 by manipulating power series. 


(i ay (Ee) — tan, 
(i A= 2-2. 
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10. 


11. 


sinz __ ] 


Sin < 
1G —; 
sin(z~) 
i= 5 
E COS=Z 
- ] 
pans z4 — 272 +3 


(vin) f(z) = deeb — 1]. 


(a) 


Suppose that we write a differentiable complex function f as f = u + 
iv, where uv and v are real-valued. Let # and v denote the restrictions 
of w and v to the real numbers. In other words, #(x) = u(x) for real 
numbers x (but # is not defined for other x). Using Problem 26-9, show 
that for real x we have 


f(x) = u(x) + id'(x), 


where f’ denotes the complex derivative, while a’ and v’ denote the 
ordinary derivatives of these real-valued functions on R. 
Show, more generally, that 


f QS wae a), 
Suppose that f satisfies the equation 
(emmy Wsboreif 9 oorany =O, 


where the a; are real numbers, and where the f“ denote higher-order 
complex derivatives. Show that #@ and v satisfy the same equation, where 
a and o now denote higher-order derivatives of real-valued functions 
on R. 

Show that if a = b+ ci is a complex root of the equation z” eae = oe 
--»+ag = 0, then f(x) = e’* sincx and f(x) = e’* cosex are both 
solutions of (*). 


Show that exp ts not one-one on C. 

Given w # 0, show that e* = w if and only if z =x +iy with x = log |w| 
(here log denotes the real logarithm function), and y an argument of w. 
Show that there does not exist a continuous function log defined for 
nonzero complex numbers, such that exp(log(z)) = z for all z # 0. 
(Show that log cannot even be defined continuously for |z| = 1.) 


Since there is no way to define a continuous logarithm function we can- 
not speak of the logarithm of a complex number, but only of “a logarithm 
for w,” meaning one of the infinitely many numbers ¢ with e© = w. And 


12. 


13. 
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for complex numbers a and b we define a? to be a set of complex num- 
bers, namely the set of all numbers e?!°8¢ or, more precisely, the set of 
all numbers e?* where z is a logarithm for a. 


If m is an integer, then a” consists of only one number, the one given by 
the usual elementary definition of a’. 

™/” coincides with the set of values 
given by the usual elementary definition, namely the set of all b” where b 


is an mth root of a. 


If m and n are integers, then the set a 


If a and b are real and b is irrational, then a? 


members, even for a > 0. 

Find all logarithms of 7, and find all values of i‘. 

By (a’)* we mean the set of all numbers of the form z° for some number z 
in the set a’. Show that (1')! has infinitely many values, while 1!" has 


contains infinitely many 


only one. 


b-c 


Show that all values of a? are also values of (a”)°. Is a?© = (a’)*N(a°)?? 


For real x show that we can choose log(x +7) and log(x — /) to be 
a 
log(x +7) =log(V1+x7) +i i — arctan x) : 
log(x —i) = log(v 1+ a} —j S — arctan x) : 


(It will help to note that 2/2 — arctan x = arctan 1/x for x > 0.) 
‘The expression 


= l 1 
l+x2 Wo Vy ee 


yields, formally, 


| be 2etenal 


Use part (a) to check that this answer agrees with the usual one. 


| ds in eee — i) —log(x +4)]. 


A sequence {a,} of complex numbers is called a Cauchy sequence if 


hin? (Gn 4,| = OS Suppose that a, — Dro icm where bwandicy ave 
m,R CO 


real. Prove that {a,} is a Cauchy sequence if and only if {b,} and {cy} 
are Cauchy sequences. 

Prove that every Cauchy sequence of complex numbers converges. 

Give direct proofs, without using theorems about real series, that an 
absolutely convergent series is convergent and that any rearrangement 
has the same sum. (It is permitted, and in fact advisable, to use the proofs 
of the corresponding theorems for real series.) 


Prove that 


A Hl 
n [nx SIO || Se 
le SS) ae 
D AUS ma: = Ay gi(ntl)x/2 
— (é ( ae ae = pix mn, # € : 
= sill ~ 


z 
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15. 


16. 


n HT 


(b) Deduce the formulas for Y= cos kx and y- sin kx that are given in 


k=] k=1 
Problem 15-33. 


Letigg) bathe Fibonacgi sequence, a) = a7 =a, = ap + Gya1: 


(amie Gyi9/G),show that 7.7 = 1+ 1/7, 
(b) Show that if r = lim r, exists, then r = 1+1/r, so that r = (1+ V5) /2. 


n—>Oo 
(c) Prove that the limit does exist. Hint: If r, < (1 + V5 ) je, then ne — 
fie band re < 7,45. 


lo @) 
(d) Show that Diy aya" has radius of convergence 2/(1 + V5 ). (Using 


n=] 
Oo 


the unproved theorems in this chapter and the fact that ea = 
all 

—1/ (<* +z—1) from Problem 24-16 we could have predicted that the 

radius of convergence is the smallest absolute value of the roots of z? + 

z— 1 = 0; since the roots are (—1 4 V5) /2, the radius of convergence 

should be (—1 + V5) /2. Notice that this number is indeed equal to 


2/(1 + V5).) 


. 7 . . 9) 
Since (e? — 1)/z can be written as the power series 1 + 2/2! + z°/3!+--- 
which is nonzcro at O, it follows that there is a power series 


with nonzero radius of convergence. Using the unproved theorems in this 
chapter, we can even predict the radius of convergence; it is 277, since this is 


the smallest absolute value of the non-zero numbers z = 2kzi for which 
e’ —] = 90. The numbers b, appearing here are called the Bernoulli 
numbers.* 


(a) Clearly bp = 1. Now show that 


z 2 eee | 
"je ted een 
me | ge ae 
=> ae 
and deduce that 
yi = —t, by =O ifteisodd anda] 1. 


* Sometimes the numbers By = Cy i, are called the Bernoulli numbers, because by = 0 if n 
is odd and > 1 (see part (a)) and because the mumbers 63, alternate in sign, although we will not 
prove this. Other modifications of this nomenclature are also in use. 


Li. 
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(b) By finding the coefficient of 2” in the right side of the equation 


by > 2? 43 
2= (Yo: \(e+ 545 3 SB -), 
show that 
n : 
> (7) =\0' for a> ai. 
i=0 


This formula allows us to compute any by in terms of previous ones, and 
shows that each is rational. Calculate two or three of the following: 


aii ee el asp SU 
bo=5 bs=—-yH Y6= gp 53 =—3- 
*(c) Part (a) shows that 
ss b2,, -2n z e+) Zi el bie -te 
nr 2 @=1 2: et/2 ext? 


n=0 


Replace z by 2iz and show that 


ek 

2 an 2 

zcotz = y —(-1)"2 "2". 

(2n)! 
n=0 


*(d) Show that 


tan z = cot z — 2 cot 2z. 


*(e) Show that 


Dan y" | 5 
-_ ~152 2njp2n _ 4)52n—1 
tan z =y On ie (2 2s 


n=1 


(This series converges for |z| < 2/2.) 


The Bernoulli numbers play an important role in a theorem which is best 
introduced by some notational nonsense. Let us use D to denote the “differ- 


entiation operator,” so that Df denotes f’. Then D*f will mean f“) and 
x 

e? f will mean 2 f'/n! (of course this series makes no sense in general, 
n=0 

but it will make sense if f is a polynomial function, for example). Finally, 

let A denote the “difference operator” for which Af(x) = f(x + 1) — f(x). 

Now Taylor’s Theorem implies, disregarding questions of convergence, that 


fF") 


or 


2. @ 


’ PY (x) 
(x) f(x+1)— f(x)= a fee) 


n= 
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we may write this symbolically as Af = (e? — 1)f, where 1 stands for the 


“identity operator.” Even more symbolically this can be written A = e? — 1, 
which suggests that 


D 


D = ——A. 
eo 


Thus we obviously ought to have 


~ a by k 
D= ae A. 
k=0 


See er 
oH) = DSP +) - FO]. 


k=0 


The beautiful thing about all this nonsense is that it works! 


(a) Prove that (#*) is literally true if f is a polynomial function (in which case 
the infinite sum is really a finite sum). Hint: By applying (*) to f, find 
a formula for f(x +1)— f(x); then use the formula in Problem 16(b) 
to find the coefficient of f% (x) in the right side of (+). 
(b) Deduce from (+) that 
fO+-+f/M=)>) TC = (0) 
k=0 


(c) Show that for any polynomial function g we have 
n+l O° hy 
gO) $+ + a(n) = i g(tdt+ >> —fg® Pint 1) = g*— PO]. 
0 ma te 


(d) Apply this to g(x) = x? to show that 


n—l ? 
pti i 
p+ k\k-1 
Using the fact that b) = —!. show that 


p+l Pp 


i n nP Oya p 
Las ees es phat 
2 ee oP 


The first ten instances of this formula were written out in Problem 2-7, 
which offered as a challenge the discovery of the general pattern. ‘This 
may now seem to be a preposterous suggestion, but the Bernoulli num- 
bers were actually discovered in precisely this way! After writing out 
these 10 formulas, Bernoulli claims (in his posthumously printed work 
Ars Conjectandi, 1713): “Whoever will examine the series as to their regu- 
larity may be able to continue the table.” He then writes down the above 
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formula, offering no proof at all, merely noting that the coefficients b; 
(which he denoted simply by A, B, C,... ) satisfy the equation in Prob- 
lem 16(b). The relation between these numbers and the coefficients in 
the power series for z/(e* — 1) was discovered by Euler. 


*18. ‘The formula in Problem 17(c) can be generalized to the case where g is not a 
polynomial function; the infinite sum must be replaced by a finite sum plus a 
remainder term. In order to find an expression for the remainder, it is useful 
to introduce some new functions. 


(a) The Bernoulli polynomials g, are defined by 


n 


Qn (X) — 2 (7 Jonas 


k=0 
The first three are 
a l 
Q(x) =x > 
(x) =x? —x 4+ 
9(X) = xo — xX + =; 
« 6 
3x2 x 
93(x) =x cae, ee 


Show that 


@n(O) = by. 

Gn (1) = by, ifn 4 1. 
Gn (X) = NGp_\(X), 

n(x) = (—1)"gr.C1 — x). 


Hint: Prove the last equation by induction on n. 
(b) Let Ry*(x) be the remainder term in ‘Taylor’s Theorem for f“, on the 
interval [x,x + 1], so that 


N . 
f (k +n) (x ) 


(i) FO 1) — f"@)= —— + R"(2). 
n=1 i 
Prove that 
N b N h 
7 k ;. ~(k E k tipi 
f= a +1)—f%)] - DO a Rw-4' (x). 
k=0 k=0 


Hint: Imitate Problem 17(a). Notice the subscript N — k on R. 
(c) Use the integral form of the remainder to show that 


b Me el =F 
7 Ry-ak(x) = | ee 
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(d) Deduce the “Euler-Maclaurin Summation Formula”: 


oO Mee Siow 1) = + o(x +A) 
x-+n+t 1 N by 
= / e(t)dt+ aig tat+l)— Cae) + Sv(x,n), 
ag real ° 


where 


i 


fae iarll ; . ee 
cn) = — pa | NE 5 
i=0 sey) Ni 


(e) Let Ww, be the periodic function, with period 1, which satisfies y,(1) = 
g(t) for 0 <t < 1. (Part (a) implies that if n > 1, then wW, 1s continuous, 
since gy, (1) = gy, (0), and also that y, is even if n is even and odd if n is 
odd.) Show that 


xtnt+l ; 
= ii 
Satyr ec) = -| Eu eat 


xtn+l 
if ae é 
( = (-1)%! / EN Mat if x 1s.an integer), 


Unlike the remainder in ‘Taylor’s Theorem, the remainder Sy(x,7) usually 
does not satisfy sim Syv(x,n) = 0, because the Bernoulli numbers and functions 
become large very rapidly (although the first few examples do not suggest this). 
Nevertheless, unportant information can often be obtained from the summation 
formula. ‘The general situation is best discussed within the context ofa specialized 
study (“asymptotic series”), but the next problem shows one particularly important 
example. 


*€19, (a) Use the Euler-Maclaurin Formula, with N = 2, to show that 


log 1+---+log(m — 1) 


n 1 | 1 n w(t) 
= logs di = Flogn + 75 (7-1) 4 - ae dt. 


(b) Show that 


in ll" Walt) 
log (catye nro) = iD 7 | D2 dt. 


ee) 
+ . . . 2) . 
(c) Explain why the improper integral B = W(t) /2r- dt exists, and show 


that if a =exp(p £11/12),-then 


I 8 

n: a(t) 

log eat lous = Was at. 
ant /Ze-n+ if uli2r2)) _ ore 
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(d) Problem 19-41(d) shows that 


(h) 


ee i; (n!)222" 
i= hm ———. 
noo (2n)!./n 


Use part (c) to show that 


. a-n 
J/m = lim 


noo a(2n)2n+1/2e-2n /q ‘ 


5) ot) 
2n+1,—2n92n 
e2 


and conclude that a = JV 27. 


Show that 
1/2 ] 
/ y2(t) dt = / g2(t)dt = 0. 
0 0 


(You can do the computations explicitly, but the result also follows im- 
mediately from Problem 18(a).) Conclude that 


Six) [ wo ben forO <x < 1/2 
i= t)dt 
hoe 20 for lie rel, 
with w(n) = 0 for all n. Hint: Graph w on [0, 1], paying particular 


attention to its values at xp, 4, and x;, where x9 and x; are the roots 
of 2 (Figure 9). 


FIGURE 9 


Noting that w(x) = —W(1 — x), show that 
(x) = w(t) dt >0 on [0,1], 
0 


and hence everywhere, with y(n) = 0 for all n. 
Finally, use this information and integration by parts to show that 


™~ 
Wo (T) 
= it > O. 
/ pyr * 
1 


Using the fact that the maximum value of |g2(x)| for x in [0,1] is % 


conclude that 
~ W(t) l 


¢ —_ 


{(< : 
212 12n 


QO< 


vn 


Finally, conclude that 


19) ip ty > 
ld ntt'/26 ne - nt Aa nth en Wa n) 


578 Infinite Sequences and Infinite Serres 


The final result of Problem 19, a strong form of Strling’s Formula, shows that 
n! is approximately JV2x n"*!/2e-" in the sense that this expression differs from n! 
by an amount which is small compared to n when n is large. For example, for 
n = 10 we obtain 3598696 instead of 3628800, with an error < 1%. 

A more general form of Strling’s Formula illustrates the “asymptotic” nature of 
the summation formula. The same argument which was used in Problem 19 can 
now be used to show that for N > 2 we have 


= 


x 
ye by z Vn (t) Ar 


n! 
og | ——_——_ ] = 
5 (= nitl2e-n rae k(k — 1)n*-! . iNer 


Since Ww is bounded, we can obtain estimates of the form 


oO 
Yu) . RAN 
nn ~6NtN ~ A=) 


If N is large, the constant My will also be large; but for very large n the factor 


n'~% will make the product very small. Thus, the expression 


N 


b 
REV (2b | ne ae 
Vv 27 1n ¢ ep( ae) 


may be a very bad approximation for n! when n is small, but for large n (how large 
depends on N) it will be an extremely good one (how good depends on N). 


PART 5 


EPILOGUE 


There was a most ingenious Architect 
who had contrived a new Method 

for building Houses, 

by beginning at the Roof, and working 
downwards to the Foundation. 


JONATHAN SWIFT 


CHAPTER 


FIELDS 


‘Throughout this book a conscientious attempt has been made to define all im- 
portant concepts, even terms like “function,” for which an intuitive definition 1s 
often considered sufficient. But Q and R, the two main protagonists of this story, 
have only been named, never defined. What has never been defined can never be 
analyzed thoroughly, and “properties” P1 P13 must be considered assumptions, 
not theorems, about numbers. Nevertheless, the term “axiom” has been purposely 
avoided, and in this chapter the logical status of PI—P13 will be scrutinized more 
carefully. 

Like Q and R, the sets N and Z have also remained undefined. Truc, some 
talk about all four was inserted in Chapter 2, but those rough descriptions are far 
from a definition. ‘To say, for example, that N consists of 1, 2, 3, etc., merely 
hames some elements of N without identifying them (and the “etc.” is useless). 
The natural numbers can be defined, but the procedure is involved and not quite 
pertinent to the rest of the book. The Suggested Reading list contains references 
to this problem, as well as to the other steps that are required if one wishes to 
develop calculus from its basic logical starting point. ‘The further development 
of this program would proceed with the definition of Z, in terms of N, and the 
definition of Q in terms of Z. This program results in a certain well-defined 
set Q, certain explicitly defined operations + and -, and properties Pl-P12 as 
theorems. The final step in this program 1s the construction of R, in terms of Q. 
It is this last construction which concerns us. Assuming that Q has been defined, 
and that P1~ P12 have been proved for Q, we shall ultimately define R and prove all 
Oriel tor R. 

Our intention of proving P1~P13 means that we must define not only real num- 
bers, but also addition and multiplication of real numbers. Indeed, the real num- 
bers are of interest only as a set together with these operations: how the real 
numbers behave with respect to addition and multiplication is crucial; what the 
real numbers may actually be is quite irrelevant. ‘This assertion can be expressed in 
a meaningful mathematical way, by using the concept of a “field,” which includes 
as special cases the three important number systems of this book. This extraordi- 
narily important abstraction of modern mathematics incorporates the properties 
P1-P9 common to Q, R, and C. A field is a set F (of objects of any sort what- 
soever), together with two “binary operations” ++ and « defined on F (that is, two 
rules which associate to elements a and b in F, other elements a + b and a«b 
in F) for which the following conditions are satisfied: 

1) (atb)+c=at(b+c) forall a,b, and cin F. 
(2) There is some element 0 in F such that 
i)a+0O0=a_ forallam F, 
(il) for every a in F, there is some element b m F such that a + b = 0. 
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(3) a+tb=b+a forall aand bin F. 
(4) (a*b)*«c=a+(b-ec)  foralla,b, and c in F. 
(5) ‘There is some element 1 in F such that 1 4 0 and 


GQ) ael=a_forallain F, 
(ii) For every a in F with a ¥ 0, there is some element b in F such that 
aw = I. 
(66)a*b=bea_ forall a andbin F. 
(7) as(b+c) =aeb+aec forall a,b, and cin F. 


The familiar examples of fields are, as already indicated, Q, R, and C, with 
+ and + being the familiar operations of + and -. It is probably unnecessary to 
explain why these are fields, but the explanation is, at any rate, quite brief. When 
+ and + are understood to mean the ordinary + and -, the rules (1), (3), (4), (6), (7) 
are simply restatements of P1, P4, P5, P8, P9; the elements which play the role of 0 
and 1 are the numbers 0 and 1 (which accounts for the choice of the symbols 0, 1); 
and the number b in (2) or (5) is —a or a~!, respectively. (For this reason, in an 
arbitrary ficld F we denote by —a the element such that a + (—a) = 0, and by 
a7! the element such that a- a7! = 1, for a £0.) 

In addition to Q, R, and C, there are several other fields which can be described 
easily. One example is the collection Fy of all numbers a + bV2 for a, b in Q. 
The operations + and « will, once again, be the usual + and - for real numbers. 
It is necessary to point out that these operations really do produce new elements 
of i: 


(a + bJ/2) +(c +dVJ2) =(a+c)+(b Beye). which is in F}; 
(a+ bw oe “(c+ df) = (ac + 2bd) + (be + ad)V2, which is in F). 


Conditions (1), (3), (4), (6), (7) for a field are obvious for F;: since these hold for 
all real numbers, they certainly hold for all real numbers of the form a + bv2. 
Condition (2) holds because the number 0 = 0+0V2 isin F) and, for a = a+bV2 
in F) the number B = (—a) + (—b)/ 2 in F, satishes a + B = 0. Similarly, 
1 = 1+40¥V2 is in F;, so (5i) is satisfied. The verification of (5ii) is the only slightly 
dificult point. If a + bJ/2 #0, then 


] 
Gb). == 3 |: 
atby2 


it is therefore necessary to show that I/(a + bV2) is in Fy. This is true because 


| _ imine 7 a n (—b) We) 
a4 bV2 We = V2 \ia-+bV/2) at = 20") ae 2b 


(The division by a — bV2 is valid because the relation a — bV2 = 0 could be true 


only if a = b = 0 (since V2 is irrational) which is ruled out by the hypothesis 
a+tbV/2 =1().) 

The next example ofa field, 2, is considerably simpler in one respect: it con- 
tains only two elements, which we might as well denote by 0 and 1. ‘The operations 
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+ and « are described by the following tables. 


0 1 ° 0 1 
0 1 0 0 
1 1 0 1 0 1 


The verification of conditions (1)-(7) are straightforward, case-by-case checks. For 
example, condition (1) may be proved by checking the 8 equations obtained by 
setting a, b, c = 0 or 1. Notice that in this field 1 + 1 = 0; this equation may also 
be written 1 = —1. 

Our final example of a field is rather silly: F3 consists of all pairs (a,a) for a 


in R, and + and ° are defined by 


(a,a) + (bb) =(a+b,at+b), 
(a,a)*(b,b) =(a-b,a-b). 


(The + and - appearing on the right side are ordiary addition and multiplication 
for R.) The verification that F3 is a field is left to you as a simple exercise. 

A detailed investigation of the properties of fields is a study in itself, but for our 
purposes, fields provide an ideal framework in which to discuss the properties of 
numbers in the most economical way. For example, the consequences of P1—P9 
which were derived for “numbers” in Chapter | actually hold for any field; in 
particular, they are true for the fields Q, R, and C. 

Notice that certain common properties of Q, R, and C do not hold for all fields. 
For example, it is possible for the equation 1 + 1 = 0 to hold in some fields, and 
consequently a — b = b —a does not necessarily imply that a = b. For the field 
C the assertion 1 + 1 4 0 was derived from the explicit description of C; for the 
fields Q and R, however, this assertion was derived from further properties which 
do not have analogues in the conditions for a field. There is a related concept 
which does use these properties. An ordered field is a field F (with operations + 
and +) together with a certain subset P of F (the “positive” elements) with the 
following properties: 


(8) For all a in F, one and only one of the following 1s true: 
ied — 0, 
(it) a isin P, 
(in) —a is in P. 
(9) If a and b are in P, then a + bis in P. 
(10) If a and b are in P, then a+ bits in P. 


We have already seen that the ficld G cannot be made into an ordered field. 
The field Fy, with only two elements, likewise cannot be made into an ordered 
field: in fact, condition (8), applied to 1 = —1, shows that 1 must be in P; then (9) 
imphes that 1 + 1 = 0 is in P, contradicting (8). On the other hand, the field Fi, 


consisting of all numbers a + bV2 with a.b in Q, certainly can be made into 
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an ordered field: let P be the set of all a + bV2 which are positive real numbers 
(in the ordinary sense). ‘The field 3 can also be made into an ordered field; the 
description of P is left to you. 

It is natural to introduce notation for an arbitrary ordered field which corre- 
sponds to that used for Q and R: we define 


a>b if a—bismP, 
a<b fl ba, 

leSeb i a= b or a— Dd, 
i> be it “ae Dora — v0. 


Using these definitions we can reproduce, for an arbitrary ordered field F, the 
definitions of Chapter 7: 


A set A of elements of F is bounded above if there is some x in F such 
that x 2 a for all a in A. Any such x ts called an upper bound for A. An 
element x of F is a least upper bound for A if x is an upper bound for A 
and x < y for every y in F which ts an upper bound for A. 


Finally, it is possible to state an analogue of property P13 for R; this leads to the 
last abstraction of this chapter: 


A complete ordered field is an ordered field in which every nonempty set 
which is bounded above has a least upper bound. 


The consideration of fields may seem to have taken us far from the goal of 
constructing the real numbers. However, we are now provided with an intelligible 
means of formulating this goal. ‘There are two questions which will be answered 
in the remaining two chapters: 


1. Is there a complete ordered field? 
2. Is there only one complete ordered field? 


Our starting point for these considerations will be Q, assumed to be an or- 
dered field, containing N and Z as certain subsets. At one crucial point it will be 
necessary to assume another fact about Q: 


Let x be an element of Q with x > 0. Then for any y in Q there is some n 
in N such that mx > y. 


This assumption, which asserts that the rational numbers have the Archimedean 
property of the real numbers, does not follow from the other properties of an 
ordered field (for the example that denionstrates this conclusively see reference [14] 
of the Suggested Reading). ‘The important point for us is that when Q is explicitly 
constructed, properties PI-P12 appear as theorems, and so does this additional 
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assumption; if we really began from the beginning, no assumptions about Q would 
be necessary. 


PROBLEMS 


1. Let F be the set {0, 1, 2} and define operations + and + on F by the following 
tables. (Lhe rule for constructing these tables is as follows: add or multiply 
in the usual way, and then subtract the highest possible multiple of 3; thus 


ee — oso 2 42 =I.) 
+ 0 l 2 : 0 ] 2 


Show that F is a field, and prove that it cannot be made into an ordered 


field. 


2. Suppose now that we try to construct a field F having elements 0, 1, 
2, 3 with operations + and + defined as in the previous example, by adding 
or multiplying in the usual way, and then subtracting the highest possible 
multiple of 4. Show that F will not be a field. 


3. Let F = {0,1, a, B} and define operations + and + on F by the following 


tables. 
+ 0 l a B : 0 | a p 
0 0 
] | 
64 a 
p p 


Show that F is a field. 


4. (a) Let F bea field in which 1 + 1 = 0. Show that a + a = 0 for all a (this 
can also be written a = —a). 
(b) Suppose that a + a = 0 for some a 4 0. Show that 1 + 1 = 0 (and 
consequently b + b = 0 for all b). 
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10. 


(a) Show that im any field we have 


(+---+1)-( 4-41 =14+---41 
—__— OO Ore" —_—_—_—_—_—_——_——— 
m times n times min times 


for all natural numbers 7 and n. 
(b) Suppose that m the field F we have 


— 
m times 


for some natural number m. Show that the smallest m with this property 
must be a prime number (this prime number is called the characteristic 
of F). 


Let F be any field with only finitely many elements. 
(a) Show that there must be distinct natural numbers m and 7 with 


1+---+1a=14+.---4+1. 
—— a 
m times n times 


(b) Conclude that there is some natural number k with 


1+---+1=0. 
lit 
k times 
Let a, b, c, and d be elements of a field F with aed —b-+c 40. Show that 
for any @ and f in F the equations 


Gs Ke 
cox td-y= Ff, 


can be solved for x and y in F. 


Let a be an clement of a field F. A “square root” of a is an element b of F 
. 3) 
with b&§ =b-eb=a. 


(a) How many square roots does 0 have? 
(b) Suppose a 4 0. Show that if a@ has a square root, then it has two square 
roots, unless 1 + 1 = 0, in which case a has only one. 


(a) Consider an equation x7 + b+ x +c = 0, where b and ¢ are elements of 
a field F. Suppose that b? = 4+c¢ has a square root r in F. Show that 
(—b+r)/21s a solution of this equation. (Here 2 = 141 and 4 = 242.) 

(b) In the field Fy of the text, both elements clearly have a square root. 
On the other hand, it is easy to check that neither clement satisfies the 
equation x7 x #1 = 0. Thus some detail in part (a) must be incorrect. 
What is it? 

Let F be a field and a an element of F which does not have a square root. 


This problem shows how to construct a bigger field F’, containing F, in 
which a does have a square root. (‘Vhis construction has already been carried 


11. 
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through in a special case, namely, F = R and a = —1; this special case should 
guide you through this example.) 

Let F’ consist of all pairs (x, y) with x and y in F. If the operations on F 
are + and +, define operations @ and © on F’ as follows: 


(x.y) @ (z,wW) = (X $2.9 Fw), 

(x, Vv) O(z,wW) = (Xe ZT #aeyvewW, yer eX w). 
(a) Prove that F’, with the operations ® and ©, 1s a field. 
(b) Prove that 


(x, 0) @ (vy, 0) = (x + y, 0), 
(x, 0) © (y, 0) = (v« y, 0), 


so that we may agree to abbreviate (7,0) by x. 
(c) Find a square root of a = (a, 0) in F’. 


Let F be the set of all four-tuples (w, x,y,z) of real numbers. Define + 
and + by 


(5,0, u,v) + (WX, y, Z) =(8 + wt +4,u+ yy. +2), 


CSF 42, U)i© (CW, X, V5 2) = (SW. SE — UY — VSS EW a SYS 


SY:-F Uw -- 0X — £282 Vw LY — Ux): 


(a) Show that F satisfies all conditions for a field, except (6). At times the 
algebra will become quite ornate, but the existence of multiplicative in- 
verses is the only point requiring any thought. 

(b) It is customary to denote 


(O, 1, 0,0) by i, 
(O, 0, 1,0) by J, 
(0, 0,0, 1) by k. 


Find all 9 products of pairs ¢, 7, and k. The results will show in particular 
that condition (6) is definitely false. ‘This “skew field” F is known as the 
quaternions. 


CHAPTER 


CONSTRUCTION OF THE 
REAL NUMBERS 


The mass of drudgery which this chapter necessarily contais is relieved by one 
truly first-rate idea. In order to prove that a complete ordered field exists we will 
have to explicitly describe one in detail; verifying conditions (1)—(10) for an ordered 
field will be a straightforward ordeal, but the description of the field itself, of the 
elements in it, Is Ingenious indeed. 

At our disposal is the set of rational numbers, and from this raw material it is 
necessary to produce the field which will ultimately be called the real numbers. 
‘To the unmitiated this must seem utterly hopeless if only the rational numbers 
are known, where are the others to come from? By now we have had enough 
experience to realize that the situation may not be quite so hopeless as that casual 
consideration suggests. ‘The strategy to be adopted in our construction has already 
been used effectively for defining functions and complex numbers. Instead of 
trying to determine the “real nature” of these concepts, we settled for a definition 
that described enough about them to determine their mathematical properties 
completely. 

A similar proposal for defining real numbers requires a description of real num- 
bers in terms of rational numbers. ‘The observation, that a real number ought to 
be determined completely by the set of rational numbers less than it, suggests a 
strikingly simple and quite attractive possibility: a real number might (and in fact 
eventually will) be described as a collection of rational numbers. In order to make 
this proposal effective, however, some means must be found for describing “the 
set of rational numbers less than a real number” without mentioning real num- 
bers, which are still nothing more than heuristic figments of our mathematical 
imagination. 

If A is to be regarded as the set of rational numbers which are less than the 
real number a, then A ought to have the following property: If x is in A and y 
is a rational number satisfying y < x, then y is in A. In addition to this property, 
the set A should have a few others. Since there should be some rational number 
x <a, the set A should not be empty. Likewise, since there should be some 
rauonal number x > a, the set A should not be all of Q. Finally, if x < a, then 
there should be another rational number y with x < y < a@, so A should not 
contain a greatest member. 

If we temporarily regard the real numbers as known, then it is not hard to 
check (Problem 8-17) that a set A with these properties is indeed the set of rational 
numbers less than some real number @. Since the real numbers are presently 
in limbo, your proof, if you supply one, must be regarded only as an unofficial 
comment on these proceedings. It will serve to convince you, however, that we 
have uot failed to notice any crucial property of the set A. ‘There appears to be 
no reason for hesitating any longer. 
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DEFINITION A real number is a set @, of rational numbers, with the following four proper- 


is some y nq@ with y > x. 


‘The set of all real numbers is denoted by R. 


Just to remind you of the philosophy behind our definition, here is an explicit 
example of a real number: 


i (he yo) ae eye a 


It should be clear that @ is the real number which will eventually be known as wih 
but it is not an entirely trivial exercise to show that @ actually is a real number. 
The whole pomt of such an exercise is to prove this using only facts about Q; 
the hard part will be checking condition (4), but this has already appeared as a 
problem in a previous chapter (finding out which one is up to you). Notice that 
condition (4), although quite bothersome here, is really essential in order to avoid 
ambiguity; without it both 


and 
{x un@ = x <7} 


would be candidates for the “real number 1.” 

The shift from A to @ in our definition indicates both a conceptual and a no- 
tational concern. Henceforth, a real number us, by definition, a set of rational 
numbers. ‘This means, in particular, that a rational number (a member of Q) 
is not a real number; instead every rational number x has a natural counterpart 
which is a real number, namely, {y in Q : y < x}. After completing the construc- 
tion of the real numbers, we can mentally throw away the elements of Q and 
agree that Q will henceforth denote these special sets. For the moment, however, 
it will be necessary to work at the same time with rational numbers, real numbers 
(sets of rational numbers) and even sets of real numbers (sets of sets of rational 
numbers). Some confusion is perhaps inevitable, but proper notation should keep 
this toa minimum. Rational numbers will be denoted by lower case Roman letters 
(x, y, z, a, b, ce) and real numbers by lower case Greek letters (a, B, y); capital 
Roman letters (A, B, C) will be used to denote sets of real numbers. 

The remainder of this chapter is devoted to the definition of +, +, and P for R, 
and a proof that with these structures R is indeed a complete ordered field. 

We shall actually begin with the definition of P, and even here we shall work 
backwards. We first define a < B; later, when +, +, and 0 are available, we shall 
define P as the set of all w with 0 < a, and prove the necessary properties for P. 
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THEOREM 


PROOF 


he reason for beginning with the definition of < is the simplicity of this concept 
in our present setup: 


Definition. If a and B are real numbers, then w < £ means that @ is contained in 
B (that is, every element of @ is also an element of f), but a # f. 


A repetition of the definitions of S$, >, 2 would be stultifying, but it is interesting 
to note that S$ can now be expressed more simply than <; if @ and f are real 
numbers, then aw S f if and only if @ is contamed in B. 

If A is a bounded collection of real numbers, it is almost obvious that A should 
have a least upper bound. Each @ in A is a collection of rational numbers; if these 
rational numbers are all put in one collection B, then B is presumably sup A. In 
the proof of the followmeg theorem we check all the little details which have not 
been mentioned, not least of which is the assertion that 6 7s a real number. (We 
will not bother numbering theorems in this chapter, since they all add up to one 
big ‘Pheorem: There is a complete ordered field.) 


If A is a set of real numbers and A ¥ # and A is bounded above, then A has a 
least upper bound. 


Let B = {x : x isin some a im A}. Then @ is certaimly a collection of rational 
numbers; the proof that 6 is a real number requires checking four facts. 


(1) Suppose that x is in 6 and y <x. The first condition means that x is in @ 
for some @ in A. Since @ is a real number, the assumption y < x implies 
that y is in @. Therefore it is certainly truce that y is in fp. 

(2) Since A 4 Y, there is some @ in A. Since @ ts a real number, there is some 
¥X it) oe 1 le means that x isiin 6, S0.6 4 V. 

(3) Since A is bounded above, there is some real number y such that a < y 
for every a in A. Since y is a real number, there 1s some rational number 
x which is not in y. Now a < y means that @ is contained in y, so it 1s 
also true that x is not in @ for any a in A. This means that x is not in B; 
sop #Q. 

(4) Suppose that x isin £. Then x is in @ for some a in A. Since @ does not 
have a greatest member, there is some rational number y with x < y and y 
ia. But this means that y is in B; thus B does not have a greatest member. 


These four observations prove that 6 is a real number. The proof that f is the 
least upper bound of A is easier. If @ is in A, then clearly @ is contamed in f; this 
means that a < f, so B is an upper bound for A. On the other hand, if y ts au 
upper bound for A, then a S y for every a@ in A; this means that @ is contained 
in y, for every a in A, and this surely implies that £ is contained in y. 7 Ais, in 
turn, means that B < y; thus £ is the least upper bound of A. ff 


The definition of + is both obvious and easy, but is must be complemented with 
a proof that this “obvious” definition makes any sense at all. 
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Defwtion. Wf a and B are real numbers, then 


a+P={x:x =y+zc for some y inq and some ¢ in f}. 


THEOREM If a and B are real numbers, then a + f is a real number. 


PROOF Once again four facts must be verified. 


(1) 


Suppose w <x for some x ina + 6. Then x = y +z for some y in @ and 
some z in B, which means that w < y + z, and consequently, w — y < z. 
This shows that w — y is in f (since z is in B, and f is a real number). Since 
w= y+(w— y), it follows that w is in a + B. 

It is clear that a + B # V, since a 4% and B FV. 

Since a # Q and f F Q, there are rational numbers a and b with a 
not in @ and b not in BB. Any x in @ satisfies x < a (for if a < x, then 
condition (1) for a real number would imply that a@ is m @); similarly any y 
in B satisfies y < b. Thus x + y < a+b for any x ina@ and yin Bf. This 
shows that a+ bis notina + Bf, sow+p4Q. 

If x is ma + f, then x = y+z for y inq@ and z in Bf. There are y ina 
and z’ in 8 with y < y’ and z < 7; then x < y’+2’' and y’'+Z isina +8. 
Thus aw + £ has no greatest member. J 


By now you can see how tiresome this whole procedure is going to be. Every time 


we mention a new real number, we must prove that it zs a real number; this requires 


checking four conditions, and even when trivial they require concentration. There 


is really no help for this (except that it will be less boring if you check the four 


conditions for yourself). Fortunately, however, a few points of interest will arise 
now and then, and some of our theorems will be easy. In particular, two properties 
of + present no problems. 


THEOREM If a, 8, and y are real numbers, then (@ + 6) 4#y =a+ (P+ yy). 


PROOF Since (x + y)+zZ=*x+(y+2) for all rational numbers x, y, and z, every member 
of (a + B) + y is also a member of a + (6 + y), and vice versa. ff 


THEOREM If a and f are real numbers, then a + 6 = B +a. 


PROOF Left to you (even easier). ff 


‘To prove the other properties of -+ we first define 0. 


Defution. 0 = {x nQ:x < O}. 


It is, thank goodness, obvious that 0 1s a real number, and the followmg theorem 


is also simple. 
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PROOF 


THEOREM 


PROOF 


If @ is a real number, then a +O =a. 


If x ism q@ and y is in 0, then y < 0,so x+y < x. This mplis that x+y is ina. 
Thus every member of a + 0 is also a member of @. 

On the other hand, if x is in @, then there is a rational number y in @ such that 
y > x. Smce x = y+ (« — y), where y is in a, and x — y < 0 (so that x — y is 
in 0), this shows that x is in @ + 0. ‘Vhus every member of @ is also a member 


of a +0. 


The reasonable candidate for —a@ would seem to be the set 
{x m Q : —x is not in a} 


(since —x not in @ means, intuitively, that —x > @, so that x < —q). But in certain 
cases this set will not even be a real number. Although a real number @ does not 
have a greatest member, the set 


Oe — fm O 22> is notin a} 


may have a least element x9; when @ is a real number of this kind, the set 
{x : —x Is not in @} will have a greatest element —xo. It is therefore necessary to 
introduce a slight modification into the definition of —a@, which comes equipped 
with a theorem. 


Definition. If @ is a real number, then 


—a = {x in Q: —x is not ina, but — x is not the least element of Q — a}. 


If @ is a real number, then —a@ ts a real number. 


(1) Suppose thateisin —a@ and y <x. Then —y > —x. Since —x Is no&mec, 
it is also true that —y is not na. Moreover, it is clear that —y 1s not the 
smallest element of Q — a, since —x is a smaller element. This shows that 
yism—aq. 

(2) Since a 4 Q, there is some rational number y which ts not m a@. We can 
assume that y is not the smallest rational number n Q — @ (since y can 
always be replaced by any y’ > y). Then —y is in —a. Thus -—a@ ¥ @. 

(3) Since a # Y, there is some x ina. Then —x cannot possibly be in =a, so 
—«7#Q. 

(4) If x is im —a@, then —x is not in a, and there is a rational number y < —x 
which is also not n @. Let z be a ratonal number with y << < —x. Then 
z is also not in q@, and z is clearly not the smallest element of Q — a. So 

—z ism —a. Since —zc > x, this shows that —a@ does not have a greatest 

element. ff 


The proof that a + (-—a@) = 0 1s not entirely straightforward. Vhe difficulties 
are not caused, as you might presume, by the finicky details in the definition 
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of —a. Rather, at this point we require the Archimedean property of Q stated on 
page 584, which does not follow from Pl P12. ‘This property is needed to prove 
the following lemma, which plays a crucial role in the next theorem. 


Let a be a real number, and z a positive rational number. ‘Then there are (Figure |) 
rational numbers x in a, and y not in qa, such that y — x = z. Moreover, we may 
assuine that y is not the smallest element of Q — a. 


Suppose first that z is in a. If the numbers 


Boye 


Bo 2 Ey) ORs 


were all m a, then every rational number would be in @, since every rational num- 
ber w satisfies w < nz for some n, by the additional assumption on page 584. ‘This 
contradicts the fact that @ 1s a real number, so there is some k such that x = kz is 
ina and y = (k+ 1)z is not n a. Clearly y — x = z. 

Moreover, if y happens to be the smallest element of Q — a, let x’ > x be an 
element of a@, and replace x by x’, and y by y + (v’ — x). 

If z is not nq, there is a similar proof, based on the fact that the numbers (—n)z 
cannot all fail to be in a. J 


> 
“a 


——$$ 
> | 
FIGURE 1 x » 
If aw is a real number, then 
a + (—a) = 0. 


Suppose x is in @ and y isin —a@. Then —y is not m a, so —y > x. Hence 
x+y <0,so x+yisin 0. Thus every member of a + (—a@) 1s in 0. 

It is a little more difficult to go in the other direction. If z is in 0, then —z > 0. 
According to the lemma, there is some x in @, and some y not in @, with y not the 
smallest element of Q — a, such that y — x = —z. This equation can be written 
x+(—y) =<. Since x is in a, and —y is in =a, this proves that z is ina + (—a). 


Before proceeding with multiplication, we define the “positive elements” and 
prove a basic property: 


Definition. P = {a n R: a > O}. 


Notice that a + f is clearly in P if @ and B are. 
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If w is a real number, then one and only one of the following conditions holds: 


(Ue O. 
(i) ais aP, 
(iii) —a@ is in P. 


If @ contains any positive rational number, then @ certainly contains all negative 
rational numbers, so @ contains 0 and aw 4 0, i.e., a isin P. If contains no 
positive rational numbers, then one of two possibilities must hold: 


(1) @ contains all negative rational numbers; then a = 0. 

(2) there is some negative rational number x which is not mo: it can be as- 
sumed that x is not the least element of Q — a (since x could be replaced 
by x/2 > x); then -@ contains the positive rational number —x, so, as we 
have just proved, —a@ is in P. 


This shows that at least one of (i)-(iii) must hold. If a = 0, it is clearly impossible 
for condition (ii) or (iii) to hold. Moreover, it is impossible that a > 0 and —a > 0 
both hold, since this would imply that 0 = a + (—a) > 0. E 


Recall that a > B was defined to mean that @ contains B, but is unequal to p. 
This definition was fine for proving completeness, but now we have to show that 
it is equivalent to the definition which would be made in terms of P. ‘Thus, we 
must show that a — B > 0 is equivalent to a > B. This is clearly a consequence 
of the next theorem. 


If w, B, and y are real numbers and a > f, thna+y>Pp+y. 


The hypothesis « > 6 implies that B is contained in a; it follows immediately from 
the definition of + that B+y is contained ina+y. This shows that a+y 2 B+y. 
We can easily rule out the possibility of equality, for if 


aty=fht+y. 


then 


a=(aty)+(-y) =(B+ty) + (-y) =f. 
which is false. Thusa+y >i +y. 


Multiplication presents difficulties of its own. If a, Bp > 0, then a+ B can be 
defined as follows. 


Definition. \f a and fB are real numbers and a, B > 0, then 


a-B={z:z2<0orz=x-y forsome x in@ and y in B with x, y> 0}. 


If w and f are real numbers with a, B > 0, then a + fp is a real number. 
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PROOF As usual, we must check four conditions. 


(1) Suppose w < c, where z is ina» B. If w < 0, then w is automatically 
in a ¢ B. Suppose that w > 0. Then z > 0, so z = x - y for some positive x 
in @ and positive y in B. Now 


WZ Wxy uw) 
i —| m= DY: 


Since 0 < w < z, we have w/z < 1, so (w/z)-x isin a@. Thus wis ma: B. 

(2) Clearly a - BA 9. 

(3) [fxs not ina, and Wis not im B, then x > x’ for ally’ santa, andes 
for all y’ in B. Hence xy > x’y’ for all such positive x’ and y’. So xy is not 
ina+f;thu:a-Bp4AQ. 

(4) Suppose w is in a+ B, and w < 0. There is some x in @ with x > 0 and 
some y in 6 with y > 0. Then z= xy is n a+ 8 and z > w. Now suppose 
uw > 0. Then w = xy for some positive x m @ and some positive y in B. 
Moreover, @ contains some %—> x: 1f £°S,x'y, then" = yy — w. ana < Is 
in a+. Thus a+ B does not have a greatest element. § 


Notice that @ + B is clearly in P if @ and B are. This completes the verification 
of all properties of P. ‘To complete the definition of + we first define |@|. 


Definition. If a is a real number, then 


Definition. If a and B are real numbers, then 


0, te=0orp=0 
a-p= 4% jaf-[Al. fa>O0,p>Oora <0,6 <0 
—(la]-|A]), itfa>0,8 <Oora <0,f8 >0. 


As one might suspect, the proofs of the properties of multiplication usually m- 
volve reduction to the case of positive numbers. 


THEOREM If a, B, and y are real numbers, then a + (B+ y) = (a+ B)ey. 


prooF ‘This 1s clear if a, B, y > 0. The proof for the general case requires considerme 
separate cases (and is simplified slightly if one uses the following theorem). J 


THEOREM Hf @ and £6 are real numbers, then a + B = Bea. 
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PROOF 


THEOREM 


PROOF 


THEOREM 


PROOF 


LEMMA 


This is clear if a, B > 0, and the other cases are easily checked. J 


Deron. We fx 1m Ow: x <_1}. 


(It is clear that 1 ts a real number.) 
If w is a real number, then a-l=a. 


Let a > 0. It is easy to see that every member of @ +1 is also a member of a@. 

On the other hand, suppose x is ina. If x < 0, then x is automatically in a «1. 

If x > QO, then there is some rational number y in @ such that x < y. Then 

x =y-(x/y), and x/y isin 1, so x is na@-+1. This proves that a> l=a if a > 0. 
If a <0, then, applying the result just proved, we have 


ael=—(fa]-f1]) = —(a]) =a. 


Finally, the theorem is obvious when a = 0. J 


Defimtion. If a is a real number and @ > 0, then 


a7! = {x in Q : x <0, or x > 0 and I/x is not in a, but 1/x is not the smallest 


member of Q — a}; 


if a <0, then a7! = =—(faf)7!. 


1 


If a is a real number uncqual to 0, then a7 is a real number. 


Clearly it suffices to consider only a > 0. Four conditions must be checked. 


(1) Suppose y <x, and x isin a7). If y < 0, then y isin a7!. If y > 0, then 


x > 0, so'l/x 1s not in @. Since_l/y = |/x, 11 follows that 1/s.1s notm.as 


and I/y is clearly not the smallest element of Q — a, so y is in a7. 


(2) Clearly a=! £ @. 

(3) Since a > O, there is some positive rational number x in a. Then |/x ts 
notin a7!, soa! 4 Q. 

(4) Suppose x is in a7, 

-l 


1 


If x < 0, there is clearly some y nm a@~ with y > x 


because a7 contains some positive rationals. If x > 0, then 1/x is not in a. 
Since 1/x is not the smallest member of Q — a, there is a rational number 
y not in a, with y < I/x. Choose a rational number z with y < ¢ < I/yx. 
Then 1/z is in a7! : 


member. J 


, and 1/z > x. Thus a~ does not contain a largest 


In order to prove that a7! is really the multiplicative inverse of a, it helps to 
have another lemma, which is the multiplicative analogue of our first lemma. 


Let a be a real number with @ > 0, and z a rational number with < > 1. ‘Then 
there are rauonal numbers x in a, aud y not ina, such that y/x = c. Moreover, 
we can assume that y is not the least clement of Q — a. 


PROOF 


THEOREM 


PROOF 


THEOREM 


PROOF 


29. Construction of the Real Numbers 597 


Suppose first that zis in @. Since < — | > O and 
can en oe) cy a: 


it follows that the numbers 


cannot all be ina. So there is some k such that x = <‘ isin @, and y =z 


in a. Clearly y/x =<. Moreover, if y happens to be the least element of Q — a, 
let x’ > x be an element of a, and replace x by x’ and y by yx'/x. 

If z is not in a, there is a similar proof, based on the fact that the numbers 1/2‘ 
cannot all fail to be ina. J 


k+l is not 


If aw is areal number and a £0, then a-a7! = 1. 


It obviously suffices to consider only a > 0, in which case a7! > 0. Suppose that 
X 1s a positive rational number in @, and y ts a positive rational number in aa!, 
Then I/y is not in a, so 1/y > x; consequently xy < 1, which means that xy is 
in 1. Since all rational numbers x < 0 are also in 1, this shows that every member 
of aaa! is in 1. 

To prove the converse ‘assertion, let z be m= 1. If = 0, then®elgariy 2 is"in 
aa, Suppose 0 < z < 1. According to the lemma, there are positive rational 
numbers x in a, and y not ina, such that y/x = 1/z; and we can assume that y 
is not the smallest element of Q — a. But this means that z = x - (1/y), where x 
is mar, and 1/y is ina7!. Consequently, z is ina- a7. § 

We are almost done! Only the proof of the distributive law remains. Once again 
we must consider many cases, but do not despair. ‘The case when all numbers are 
positive contains an interesting point, and the other cases can all be taken care of 


very neatly. 
If a, B, and y are real numbers, thena+(B +y) =a+B+aery. 


Assume first that a, B, y > 0. ‘Then both numbers in the equation contain all 
rational numbers < 0. A positive rational number in @ « (6 + y) is of the form 
x -(y +z) for positive x in a, y in B, and zin y. Since x-(y +z) =x-y+x-z, 
where x + y is a positive element of @ + 6, and x - z is a positive element of ay, 
this number is also in@+ B #aey. Thus, every element of @ + (B + y) is also in 
a-P+ary. 

On the other hand, a positive rational number in a+ B # a+ y ts of the form 
Miewe¥ot x2 * Z for positiv¥e.4],.49 Ina, ¥ in BP, and z in y. If x; = xo, then 
(xy,/ x2)? yy = HysO G/ xo) y is m8. Thus 


ey ap ee en (1) XD) Te 


ismae(B + y). Of course, the same trick works if x2 < x}. 
To complete the proof it is necessary to consider the cases when a, B, and y 
are not all >0. If any one of the three equals 0, the proof is easy and the cases 
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involving a < 0 can be derived immediately once all the possibilities for 6 and 
y have been accounted for. Thus we assume @w > 0 and consider three cases: 
Bb. y <0, and Bp <0, y > 0, and p > 0, y <0. The first follows immediately from 
the case already proved, and the third follows from the second by interchanging B 
and y. ‘Vherefore we concentrate on the case B <0, y > 0. There are then two 
possibihties: 
(1) B+y2O. Then 
a-y=a-([Bty| + [PP =a-(P+y) ta-[fl. 


sO 


a-(P+ty)=—-(a-|pl) tasy 


asB+taery. 


(2)B+y <0. Then 
a? |Bl=a-(B+tvlt+y)=a°[BP+ty| tary, 


sO 


a-(By)=—@-[B+y)=-@-|s)ta-y=a-ptoary.f 


This proof completes the work of the chapter. Although long and frequently 
tedious, this chapter contains results suffictently important to be read in detail at 
least once (and preferably not more than once!). For the first time we know that we 
have not been operating ina vacuum ~ there is indeed a complete ordercd field, the 
theorems of this book are not based on assumptions which can never be realized. 
One interestng and horrid possibility remains: there may be several complete 
ordered fields. If this is true, then the theorems of calculus are unexpectedly rich 
in content, but the properties PI-P13 are disappointingly incomplete. ‘The last 
chapter disposes of this possibility; properties PI-P13 completely characterize the 
real numbers—anything that can be proved about real numbers can be proved on 
the basis of these properties alone. 


PROBLEMS 

There are only two problems in this set, but each asks for an entirely chflerent 
construction of the real numbers! The detailed examination of another construc- 
tion is recommended only for masochists, but the main idea behind these other 
constructions 1s worth knowing. ‘The real numbers constructed in this chapter 
might be called “the algebraist’s real numbers,” since they were purposely defined 
so as to guarantee the least upper bound property, which involves the ordering <, 
an algebraic notion. “Vhe real number system constructed in the next problem 
nnght be called “the analyst’s real numbers,” since they are devised so that Cauchy 
sequences will always converge. 


1. Since every real number ought to be the limit of some Cauchy sequence 
of rational numbers, we night try to define a real number to be a Cauchy 
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sequence of rational numbers. Since two Cauchy sequences might converge 
to the same real number, however, this proposal requires some modifications. 


(a) Define two Cauchy sequences of rational numbers {a,} and {b,} to be 
equivalent (denoted by {a,} ~ {b,}) if lim (a, — by) = 0. Prove that 
lH 


tar| "<“{a yy that bal ea) it fan) —~ 1b) ahd that. {a,)°- [ag ail 
{an} ~ {bn} and {bn} ~ {en}. 

(b) Suppose that a@ ts the set of all sequences equivalent to {a,}, and B is the 
set of all sequences equivalent to {b,}. Prove that either aM B = ¥ or 
a= fp. If aN B £ ¥, then there is some {c,} m both aw and f. Show that 
in this case @ and f both consist precisely of those sequences equivalent 
to {Cn }.) 


Part (b) shows that the collection of all Cauchy sequences can be split up 
into cisjomt sets, each set consisting of all sequences equivalent to some 
fixed sequence. We define a real number to be such a collection, and 
denote the set of all real numbers by R. 

(c) If @ and Bf are real numbers, let {a,} be a sequence in a, and {b,} a 
sequence in B. Define a + B to be the collection of all sequences equiva- 
lent to the sequence {a, +5,}. Show that {a,+,} 1s a Cauchy sequence 
and also show that this definition does not depend on the particular se- 
quences {a,} and {b,} chosen for @ and B. Check also that the analogous 
definition of multiplication is well defined. 

(d) Show that R is a field with these operations; existence of a multiplicative 
inverse is the only interesting point to check. 

(ec) Define the positive real numbers P so that R will be an ordered field. 

(f) Prove that every Cauchy sequence of real numbers converges. Remem- 
ber that if {a} is a sequence of real numbers, then each @, is itself a 
collection of Cauchy sequences of rational numbers. 


This problem outlines a construction of “the high-school student’s real num- 
bers.” We define a real number to be a pair (a, {bn}), where a is an inte- 
ger and {b,} is a sequence of natural numbers from 0 to 9, with the pro- 


viso that the sequence is not eventually 9; intuitively, this pair represents 
CO 


a+ ns 10°". With this definition, a real number is a very concrete ob- 
=| 

ject, but the dificultes mvolved m defining addition and multiplication are 

formidable (how do you add infinite decimals without worrying about car- 

rying digits infinitely far out?). A reasonable approach is outlined below; the 

trick is to use least upper bounds right from the start. 


(a) Define (a, {B,}) < (c, {d,}) if a <e, or if a =e and for some nr we have 
b, <d, but bj =d; for 1 < j <n. Using this definition, prove the least 
upper bound property. 


k 
(b) Giver’ a = (a; (bq), deine a, =a + eal "; mtuitively, ag is the 


n=] 
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rational number obtained by changing all decimal places after the kth 
k 
to 0. Conversely, given a rational number r of the form a+ ) b10-., 
n=1 
let r’ denote the real number (a, {b,'}), where b,’ = b, for 1 <n <k 
and b,' = 0 forn > k. Now for a = (a, {b,}) and B = (c, {d,}) define 


a + B = sup{(a@, + Bj)’ : k a natural number} 
(the least upper bound exists by part (a)). If multiplication is defined 
similarly, then the verification of all conditions for a field is a straight- 
forward task, not highly recommended. Once more, however, existence 
of multiplicative inverses will be the hardest. 


CHAPTER 


UNIQUENESS OF THE REAL NUMBERS 


We shall now revert to the usual notation for real numbers, reserving boldface 
symbols for other fields which may turn up. Moreover, we will regard integers and 
rational numbers as special kinds of real numbers, and forget about the specific 
way in which real numbers were defined. In this chapter we are interested in only 
one question: are there any complete ordered fields other than R? The answer 
to this question, if taken literally, is “yes.” For example, the field F3 introduced in 
Chapter 28 is a complete ordered field, and it is certamly not R. This field is a 
“silly” example because the pair (a, a) can be regarded as just another name for 
the real number a; the operations 


(a,a) + (b,b) =(a+b,at+b), 
(araine (3b) =(a- 0. @ = be 


are consistent with this renaming. This sort of example shows that any intelligent 
consideration of the question requires some mathematical means of discussing such 
renaming procedures. 

If the elements of a field F are going to be used to rename elements of R, then 
for each a in R there should correspond a “name” f(a) in F. The notation f(a) 
suggests that renaming can be formulated in terms of functions. In order to do 
this we will need a concept of function much more general than any which has 
occurred until now; in fact, we will require the most general notion of “function” 
used in mathematics. A function, m this general sense, is simply a rule which 
assigns to some things, other things. ‘To be formal, a function is a collection of 
ordered pairs (of objects of any sort) which does not contain two distinct pairs with 
the same first element. ‘The domain of a function f is the set A of all objects a 
such that (a,b) is in f for some b; this (unique) b is denoted by f(a). If f(a) is 
in the set B for all a in A, then /f is called a function from A to B. For example, 


if f(x) = sin x for all x in R (and / is defined only for x in R), then f is a 
function from R to R; it is also a function from R to [—1. 1]; 


if f(z) =smz¢ for all z in G, then f is a function from C to C; 


if efi(z) = e*.for allezan C,"then f ts a*function from C to €; it is also a 
function from C to {z in GC: z 40}: 


Ghs.a-function ‘from {zineGiog 0} to (x mR: 0 <4 < 27}; 


if f is the collection of all pairs (a, (a,a)) for a in R, then f is a function 
from R to F3. 
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Suppose that Fy and F 2 are two fields; we will denote the operations in Fy by 
®, ©, etc., and the operations in Fy by +, +, etc. If Fy is going to be considered 
as a collection of new names for elements of Fy, then there should be a function 
from F; to Fz with the followmg properties: 


(1) The function f should be one-one, that is, if x 4 y, then we should have 
f(x) # f(y); this means that no two elements of Fy have the same name. 

(2) The function f should be “onto,” that is, for every element z in F> there 
should be some x in Fy such that z = f(x); this means that every element 
of F> is used to name some element of F). 

(3) For all x and y in F; we should have 


fa @Oy)= f@) + fO), 
f 1@m) = f(x) f(y); 


this means that the renaming procedure is consistent with the operations of 


the field. 


If we are also considering Fy and F> as ordered fields, we add one more re- 
quirement: 


(4) | ieee Ry, Metin (X).< | (x). 


A function with these properties 1s called an tsomorphism from Fy to Fy. ‘This 
definition is so important that we restate it formally. 


DEFINITION If F, and Fy) are two fields, an isomorphism from F| to F> 1s a function f 
from F; to F2 with the following properties: 


(1) We Aten (x) Ff (7)! 
(2) If istmeroe then z= f() for some x in Fj. 
(3) If x and y are in Fy, then 


of Geek — 9) Corey), 
fx Oy) = f)+ FQ). 


If Fy and Fy) are ordered fields we also require: 


(4) If x @y, then f(x) < f(y). 


The fields Fy and F> are called isomorphic if there is an isomorphism between 
them. Isomorphic fields may be regarded as essentially the same——any important 
property of one will automatically hold for the other. Therefore, we can, and 
should, reformulate the question asked at the beginning of the chapter; if F is a 
complete ordered field it is silly to expect F to equal R—rather, we would like to 
know if F is isomorphic to R. In the following theorem, F will be a held, with 
operations + and +, and “positive elements” P; we write a < b to mean that b=a 
is in P, and so forth. 


THEOREM 


PROOF 
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If F is a complete ordered field, then F is isomorphic to R. 


Since two fields are defined to be isomorphic if there is an isomorphism between 
them, we must actually construct a function f from R to F which is an isomor- 
phism. We begin by defining f on the integers as follows: 


f(O) = 0, 
fmal+...4+1 form > 0; 
: 
n times 
fay =—(1+...4+1) for <0. 
ee! 
|7| times 


It is easy to check that 


f(m+n)= fim) + f(n), 
f(m-n) = f(m) + fn), 


for all integers m and n, and it is convenient to denote f(n) by n. We then 
define f on the rational numbers by 


f(m/n) =m[n=m- n7} 

(notice that the n-fold sum 1 +--- +140 if n > 0, since F its an ordered field). 
This definition makes sense because if m/n = k/1, then ml = nk,so mel = ken, so 
m-n7!=k-I7!, It is easy to check that 


f@ +re) = fri) & f(r), 
pri 12) — f(r) fa). 


for all rational numbers r; and rp, and that f(r) < f(r2) if ry < ro. 

The definition of f(x) for arbitrary x is based on the now familiar idea that 
any real number is determined by the rational numbers less than it. For any x 
in R, let A, be the subset of F consisting of all f(r), for all rational numbers 
r <x. The set A, is certainly not empty, and it is also bounded above, for if ro 
is a rational number with rg > x, then f(ro) > f(r) for all f(r) in Ay. Since F 
is a complete ordered field, the set Ay has a least upper bound; we define f(x) as 
sup Ax. 

We now have f(x) defined in two different ways, first for rational x, and then for 
any x. Before proceeding further, it is necessary to show that these two definitions 
agree for rational x. In other words, if x is a rational number, we want to show 
that 


sup Ay = f(x), 


where f(x) here denotes m/n, for x = m/n. ‘This is not automatic, but depends 
on the completeness of F; a slight digression is thus required. 
Since F is complete, the elements 


1+...4+1 for natural numbers n 
ne 


n times 
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form a set which is not bounded above; the proof is exactly the same as the proof 
for R (Theorem 8-2). The consequences of this fact for R have exact analogues 
in F: in particular, if a and b are elements of F with a < b, then there is a rational 
number r such that 


Ga (r) =< dD. 


Having made this observation, we return to the proof that the two definitions 
of f(x) agree for rational x. If y ts a rational number with y < x, then we 
have already seen that f(y) < f(x). Thus every element of A, is < f(x). 
Consequently, 

sup Ax S f(x). 


On the other hand, suppose that we had 
sup Ay < / (2)! 
Then there would be a rational number r such that 
supAy < J (7) < 7). 


But the condition f(r) < f(+*) means that r < x, which means that f(r) is in the 
set Ax; this clearly contradicts the condition sup Ay < f(r). This shows that the 
original assumption is false, so 


Supe fale): 


We thus have a certain well-defined function f from R to F. In order to show 
that f is an isomorphism we must verify conditions (1)—(4) of the definition. We 
will begin with (4). 

If x and y are real numbers with x < y, then clearly A, 1s contained in Ay. 
Thus 

f(x) = sup A, < supA, = f(y). 
To rule out the possibility of equality, notice that there are rational numbers r 


and s with 
Xe ieee 


We know that f(r) < f(s). It follows that 
LAX) Sf =< fies fOr 


This proves (4). 

Condition (1) follows immediately from (4): If x 4 y, then either x < y or 
y < x; in the first case f(x) < f(y), and in the second case f(y) < f(x); m 
either case f(x) ==5) ik 

To prove (2), let a be an element of F, and let B be the set of all rational 
numbers r with f(r) <a. The set B is not empty, and it is also bounded above, 
because there is a rational number s with f(s) > a, so that f(s) > f(r) for r 
in B, which imphes that s > r. Let x be the least upper bound of B; we claim 
that f(x) =a. In order to prove this it suffices to eliminate the alternatives 


if O), 4.4; 
a< f Ge: 
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In the first case there would be a rational number r with 


f(x) < f(r) <a. 


But this means that x < r and that r is in B, which contradicts the fact that 
x = sup B. In the second case there would be a rational number r with 


a@<= f(r) < J). 


This implies that r < x. Since x = sup B, this means that r < s for some s in B. 
Hence 


fir) < f(s) <a, 


again a contradiction. Thus f(x) = a, proving (2). 
To check (3), let x and y be real numbers and suppose that f(x + y) # 
FO) + @)o Then either 


fatty) < f@) + ff) or CO +4 7G) <=7C =)! 
In the first case there would be a rational number r such that 


Pier) <r Cero): 


But this would mean that 
if ae) Se 


Therefore r could be written as the sum of two rational numbers 
b= tet ro: where x < rj and y <r. 
Then, using the facts checked about f for rational numbers, it would follow that 
PO) = fi +n) = fen By Gs) = 7 Oe). 


a contradiction. The other case is handled similarly. 
Finally, if x and y are positive real numbers, the same sort of reasoning shows 
that 


IC gam C6): 


the general case is then a simple consequence. J 


This theorem brings to an end our investigation of the real numbers, and resolves 
any doubts about them: ‘There 7s a complete ordered field and, up to isomorphism, 
only one complete ordercd ficld. It is an important part of a mathematical educa- 
tion to follow a construction of the real numbers in detail, but it is not necessary 
to refer ever again to this particular construction. It is utterly irrelevant that a real 
number happens to be a collection of rational numbers, and such a fact should 
never enter the proof of any important theorem about the real numbers. Reason- 
able proofs should use only the fact that the real numbers are a complete ordered 
ficld, because this property of the real numbers characterizes them up to isomor- 
phism, and any significant mathematical property of the real numbers will be true 
for all isomorphic ficlds. ‘To be candid I should admit that this last assertion is just 
a prejudice of the author, but it is one shared by almost all other mathematicians. 
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PROBLEMS 


IG 


Let f be an isomorphism from F; to F>. 


(a) Show that ((0) = 0 and f(1) = 1. (Here 0 and 1 on the left denote 
elements i Fy, while 0 and 1 on the right denote elements of F.) 


(b) Show that f(—a) = —f(a) and f(a7!) = f(a)7!, for a 4 0. 


Here is an opportunity to convince yourself that any significant property of 
a field is shared by any field isomorphic to it. The point of this problem is 
to write out very formal proofs until you are certain that all statements of 
this sort are obvious. Fy and F2 will be two fields which are isomorphic; for 
simplicity we will denote the operations in both by + and +. Show that: 


(a) If the equation x7 + 1 = 0 has a solution in Fy, then it has a solution 
in Fp. 

(b) If every polynomial equation x” + a,—, ¢ x"! + +++ ag = 0 with 
dQ, ---,@,-1 m F,, has a root in Fj, then every polynomial equation 
x" tb, 1 ox"! +--+ bp ='0 with bo, .... b,—) in F> has a root in Fy. 


c) If 1 +---+ 1 (summed m times) = 0 in Fj, then the same is true in F). 
\ Ba 
(d) If Fy and F> are ordered fields (and the isomorphism /f satisfies f(x) < 


f(y) for x < y) and F; is complete, then F is complete. 


Let f be an isomorphism from F, to Fy and g an isomorphism from F 
to F3. Define the function go f from F; to F3 by (g 0 f)(x) = g(f(x)). 
Show that go f is an isomorphism. 


Suppose that F is a complete ordered field, so that there is an isomorphism f 
from R to F. Show that there is actually only one isomorphism from R 
to F. Hint: In case F = R, this is Problem 3-17. Now if f and g are two 
isomorphisms from R to F consider g-! o f. 


Find an isomorphism from C to C other than the identity function. 
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A man ought to read 

just-as inclination leads him; 
Sor what he reads as a task 
will do him little good. 


SAMUEL JOHNSON 


One purpose of this bibliography is to guide the reader to other sources, but the 
most important function it can serve is to indicate the variety of mathematical 
reading available. Consequently, there is an attempt to achieve diversity, but no 
pretense of being complete. ‘Vhe present plethora of mathematics books would 
make such an undertaking almost hopeless in any case, and since I have tried to 
encourage independent reading, the more standard a text, the less likely it is to 
appear here. In some cases, this philosophy may seem to have been carried to 
extremes, as some entries in the list cannot be read by a student just finishing a 
first course of calculus until several years have elapsed. Nevertheless, there are 
many selections which can be read now, and I can’t believe that it hurts to have 
some idea of what lies ahead. 

kor most references, only the title and author have been given, since so many of 
these books have gone through numerous editions and printings, often having gone 
out of print at some pomt only to be resurrected later on by a different publisher 
(often as an inexpensive paperback by the redoubtable Dover Publications or by 
the Mathematical Association of America). More exact information really isn’t 
necessary, since it is Now so easy to search for books on-line at Amazon.com and 
other sites. 

+ is used to indicate books whose availability, either new or used, is problematical. 
Author and title searches may turn up other intriguing books by the same author, 
or other books with similar titles. In addition, many of these books will still be 
found in well-stocked academic hbraries, perhaps the best place of all to search; 
despite the convenience of the internet, nothing matches the experience of an 
actual (as opposed to a virtual) hbrary, with books stacked according to subject, 
awaiting serendipitous discovery. 


One of the most elementary unproved theorems mentioned in this book is the 
“Fundamental Theorem of Arithmetic”, that every natural number can be written 
as a product of primes in only one way. This follows from the basic fact alluded 
to on page 444, a proof of which will be found near the beginning of almost 
any book on elementary number theory. Few books have won so enthusiastic an 
audience as 


[1] An Introduction to the Theory of Numbers, by G. H. Hardy and E. M. Wright. 
‘Two other recommended books are 


+ [2] Al Selection of Problems in the Theory of Numbers, by WW. Sierpinski. 
[3] Three Pearls of Number Theory, by A. Khinchin. 


The Fundamental ‘Theorem also apphes in more general algebraic settings, see 


references [33] and [34]. 


The subject of irrational numbers straddles the fields of number theory and 
analysis. An excellent introduction will be found in 


[4]  /rrational Numbers, by 1. M. Niven. 
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‘Together with many historical notes, there are references to some fairly elementary 
arucles m journals. There is also a proof that 7 is transcendental (see also [59]) 
and, finally, a proof of the “Gelfond-Schneider theorem”: If a and b are algebraic, 
with a 4 0 or 1, and b is irrational, then a? is transcendental. 


All the books listed so far begin with natural numbers, but whenever necessary 
take for granted the irrational numbers, not to mention the integers and rational 
numbers. Several books present a construction of the rational numbers from the 
natural numbers, but one of the most lucid treatments is still to be found in 


[5] Foundations of Analysis, by E. Landau. 
While many mathematicians are content to accept the natural numbers as a nat- 
ural starting point, numbers can be defined m terms of sets, the most basic starting 


point of all. A charming exposition of set theory can be found in a sophisticated 
little book called 


[6] .Nawe Set Theory, by P. R. Halmos. 
Another very good introduction 1s 
[7] Theory of Sets, by E. Kamke. 


Perhaps it is necessary to assure some victims of the “new math” that set theory 
does have some mathematical content (in fact, some very deep theorems). Using 
these deep results, Kamke proves that there is a discontinuous function f such 
that f(x + y) = f(x) + f(y) for all x and y. 


Inequalities, which were treated as an elementary topic m Chapters | and 2, 
actually form a specialized field. A good elementary mtroduction its provided by 


[8] Analytic Inequalities, by N. Kazarinoff. 


Twelve different proofs that the geometric mean is less than or equal to the arith- 
metic mean, each based on a different principle, can be found in the beginning of 
the more advanced book 


[9] An Introduction to Inequahtes, by F. Beckenbach and R. Bellman. 
The classic work on mequalities is 
[10] /nequahties, by G. H. Hardy, J. E. Littlewood, and G. Polya. 
Each of the authors of this triple collaboration has provided his own contribution 


to the sparse literature about the nature of mathematical thinking, written from a 
mathematician’s pomt of view. My favorite is 


[11] ot Afathematicran’s Apology, by G. H. Hardy. 
Litthewood’s anecdotal selections are enutled 


+ [12] al Mathematicran’s Miscellany, by J. 12. Litthewood. 
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Polya’s contribution is pedagogy at the highest level: 


[13] Wathematics and Plausible Reasoning (Vol. 1: Induction and Analogy in Mathematics: 
+ Vol. I: Patterns of Plausible Inference), by G. Polya. 


Geometry ts the other main field which can be considered as background for 
calculus. Vhough Euchid’s Elements is stll a masterful mathematical work, greater 
perspective is supplied by some more modern texts, which examine foundational 
questions, non-Euchdean geometry, the role of the “Archimedean axiom” in geom- 
etry, and further results from “classical geometry”. Of the following three books, 
the first, listed m previous editions of this book, has probably been supplanted by 
the later ones, which cover some more advanced material, and perhaps require a 
little more sophistication on the part of the reader. 


+ [14] Elementary Geometry from an Advanced Standpoint, by E. Moise. 
[15] Huchdean and Non-Eucldean Geometnes, by M. J. Greenberg. 
[16] Geometry: Euchd and Beyond, by R. Hartshorne. 


In addition, all sorts of fascinating geometric things can be found in 
[17] Introduction to Geometry, by H. S. Coxeter. 
Almost all treatments of geometry at least mention convexity, which forms an- 


other specialized topic. I cannot imagine a better introduction to convexity, or a 
better mathematical experience m general, than reading and working through 


T [18] Conver Figures, by I. M. Yaglom and W. G, Boltyanski. 


This book contams a carefully arranged sequence of definitions and statements of 
theorems, whose proofs are to be supphed by the reader (worked-out proofs are 
supplied in the back of the book). Its current unavailability is perhaps a testament 
to the lack of interest in working through exercises, which might also apply to 
another geometry book modeled on the same principle: 


+ [19] Combinatorial Geometry in the Plane, by H. Hadwiger and H. Debrunner. 


Along with these two out-of-the-ordinary books, I might mention an extremely 
valuable little book, also of a specialized sort, 


[20] Counterexamples in Analysis, by B. Gelbaum and J. Olmsted. 
Many of the example in this book come from more advanced topics in analysis, 
but quite a few can be appreciated by someone who knows calculus. 

Of the infinitude of calculus books, two are considered classics: 


1] A Course of Pure Mathemates, by G. H. Hardy. 
2| Differential and Integral Calculus (two volumes), by R. Courant. 
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Courant is especially strong on applications to physics. ‘There is also a more mod- 
ern update 


[23] Introduction to Calculus and Analysis, by R. Courant and F John. 


Speaking of applications to physics, an elegant exposition of the material in 
Chapter 17, together with much further discussion, can be found in the article 


[24] On the geometry of the hepler problem, by John Milnor; in The American Mathe- 
matical Monthly, Volume 90 (1983), pp. 353-365. 


(In this paper the curve c’ of Chapter 17 is denoted by v. and the derivative of 
the important composition vo 67! (page 334) is introduced quite off-handedly as 
dv/d6é.) A “straight-forward” derivation of Kepler’s laws, together with numerous 
references, can be found in another article in this same journal, 
[25] The mathematical relationship between hepler’s laws and Newton’s laws, by 
Andrew ‘Tf. Hyman; in The American Mathematical Monthly, Volume 100 


Cie Sa 2-9 36° 


The later parts of Volume I of Courant contain material usually found m ad- 
vanced calculus, including differential equations and Fourier series. An introduc- 
tion to Fourier series (requiring a little advanced calculus) will also be found in 


[26] An Introduction to Fourier Series and Integrals, by R. Seeley. 


The second volume of Courant (advanced calculus in earnest) contains addi- 
tional material on differential equations, as well as an introduction to the calculus 
of variations. A widely admired book on differential equations 1s 


[27] Lectures on Ordinary Differential Equations, by W. Hurewicz. 
A good example of new approaches and new topics is provided by 
[28] Differential Equations, Dynamical Systems, and An Introduction to Chaos, 
by M. Hirsch, 8. Smale, and R. L. Devaney. 


I will bypass the more or less standard advanced calculus books (which can easily 
be found by the reader) since nowadays the presentation of advanced calculus for 
mathematics students is based upon linear algebra. One of the first treatments of 
advanced calculus using linear algebra is the very nice book 


+ [29] Calculus of Vector Functions, by R. H. Crowell and R. E. Willhamson. 
More recent books to be recommended are 


[30] Advanced Calculus of Several Vanables, by C. H. Edwards, Jr. 
[31] Afultivariable Mathematics, by ‘V. Shifrin. 


And of course 1 am still partial to an older text 


[32] Calculus on Manifolds, by M. Spivak. 
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There are three other topics which are somewhat out of place in this bibli- 
ography because they are gradually becoming established as part of a standard 
undergraduate curriculum. ‘The purposeful study of fields and related systems is 
the domain of “algebra.” Two excellent texts are 


[33] Algebra, by Michael Artin. 
[34] Abstract Algebra, by D. Dummit and R. Foote. 


For “complex analysis”, the promised land of Chapter 27, the classical text is 
[35] Complex Analysis, by L. Ahlfors. 


Rather revolutionary when it was first published, it might now be considered some- 
what old-fashioned, and you might prefer the second m a series of books (3 and 
counting) that have appeared more recently: 


[36] Fourer Analysis: An Introduction, by E. Stein and R. Shakarchi. 
[37] Complex Analysis, by E. Stein and R. Shakarchi. 
[38] Real Analysis, by E. Stem and R. Shakarchi. 


And, since the topic of “real analysis” [high-octane Calculus] has been broached, 
two classics should be mentioned. The first, affectionately known as “baby Rudin”, 
was the source of several problems that appear in this book. 


[39] Principles of Mathematical Analysis, by W. Rudin. 
[40] Functional Analysis, by \W. Rudin. 


The subject of “topology” has not been mentioned before, but it has really been 
in the background of many discussions, since it is the natural generalization of the 
ideas about limits and continuity which play such a prominent role in Part II of 
this book. ‘The standard text is now 


[41] Topology, by J. R. Munkres. 
For the related field of “differential topology”, see 


[42] Differential Topology, by V. Guillemin and A. Pollack. 


The next few topics, ranging from elementary to very difficult, are included 
in this bibhography because they have been alluded to in the text. he ganima 
function has an elegant little book devoted entirely to its properties, most of them 
proved by using the theorem of Bohr and Mollerup which was mentioned in Prob- 


lem 19-40: 
+ [43] The Gamma Function, by E. Artin 


The gamma function 1s only one of several important improper integrals in math- 
< 5 . 5 ioe! oa Nes 
ematics. In particular, the calculation of f)~ e~* dx (see Problem 19-42) is impor- 
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tant m probability theory, where the “normal distribution function” 


O(x) = i" dy 


l 
=| « 
21 J—co 
plays a fundamental role. A classic book on probability theory 1s 


[44] An Introduction to Probability Theory and Its Applications, by W. Feller. 


The impossibility of integrating certain functions in elementary terms (among 
them) (UJE= e-*) is a fairly esoteric topic. A discussion of the possibilities of 
integrating in elementary terms, with an outline of the impossibility proofs, and 
references to the original papers of Liouville, will be found in 


[45] The Integration of Functions of a Single Variable, by G. H. Hardy. 


The basically algebraic ideas behind the arguments were made much clearer over 
a hundred years after Liouville’s work, in the paper 


[46] On Liousville’s Theorem on functions with elementary integrals, by M. Rosenlicht; m 
Pacific Journal of Mathematics, Volume 24, No. 1 (1968), pp. 153-161. (Also 


available on-line: go to projecteuclid.org and search for Rosenlicht.) 
For a good overview of the subject, and some more recent developments, see 


[47] Integration in finite terms: the Liousville theory, by ‘T. Kasper; in Afathematics 
Magazine, Volume 53, No. 4 (1980), pp. 195-201. 


Reference [46] makes use of the notions of “differential algebra”, a field in which 
a related but seemingly more difficult problem had been solved earlier: ‘There are 
simple differential equations (y” + xy = 0 is a specific example) whose solutions 
cannot be expressed even in terms of indefinite integrals of elementary functions. 
This fact is proved on page 43 of the (60-pagc) book: 


+ [48] An Introduction to Differential Algebra, by 1. Kaplansky 


A few words should also be said in defense of the process of integrating in 
elementary terms, which many mathematicians look upon as an art (unlike differ- 
entiation, which is merely a skill). You are probably already aware that the process 
of integration can be expedited by tables of indefinite integrals. ‘There are several 
books containing extensive tables of integrals (and also tables of series and prod- 
ucts), but for most integrations it suffices to consult one of the fairly extensive tables 
of indefinite integrals that are available on-line, for example, at sosmath. com, and 
at wikipedia.org, with its ever-expanding source of generally definitive entries for 
mathematics and physics. 


The remaining references are of a somewhat diflerent sort. “They fall mto three 
categories, of which the first is historical. 


For the history of calculus itself, a excellent comprehensive source, filled with 
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detailed explicit examples, rather than generalized descriptions, is 
[49] The Historical Development of Calculus, by C. H. Edwards, Jr. 


Some historical remarks, and an attempt to incorporate them into the teaching of 
calculus, will be found in 


[50] The Calculus: A Genetic Approach, by O. Toeplitz. 
An admirable textbook on the history of mathematics in general is 
[51] An Introduction to the History of Mathematics, by H. Eves. 


As might be inferred from the quotation on page 39, the basic idea for constructing 
the real numbers is derived from Dedekind, whose contributions can be found in 


[52] Lssays on the Theory of Numbers, by R. Dedekind. 


‘he most important notions of set theory, especially the proper treatment of infinite 
numbers, were first introduced by Cantor, whose work is reproduced in 


[53] Contributions to the Founding of the Theory of Transfinite Numbers, by G. Cantor. 
The letter of H. A. Schwarz referred to in Problem 11-69 will be found in 
1 [54] Ways of Thought of Great Mathematicians, by H. Meschkowski. 


Finally, a great deal of interesting historical material may also be found on-line at 
the site www-groups.dcs.st-and.ac.uk/~history/ 


The second category in this final group of books might be described as “pop- 
ularizations.” ‘There are a surprisingly large number of first-rate ones by real 
mathematicians: 


[55] What is Mathematics?, by R. Courant and H. Robbins. 

[56] Geometry and the Imagination, by D. Hilbert and 8. Cohn-Vossen. 

[57] The Enjoyment of Mathematics, by H. Rademacher and O. ‘Toeplitz. 
+ [58] Famous Problems of Mathematics, by H. Tietze; Graylock Press, 1965. 


One of the most renowned “popularizations” is especially concerned with the 
teaching of mathematics: 


[59] Elementary Mathematics from an Advanced Standpoint, by F. Klem (vol. 1: Arth- 
metic, Algebra, Analysis; vol. 2: Geometry); Dover, 1948. 


Volume | contains a proof of the transcendence of 2 which, although not so ele- 
mentary as the one in [4], is a direct analogue of the proof that e is transcendental, 
replacing integrals with complex line mtegrals. It can be read as soon as the basic 
facts about complex analysis are known. 


The third category is the very opposite extreme— original papers. ‘The dih- 
culties encountered here are formidable, and I have only had the courage to list 
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one such paper, the source of the quotation for Part LV. It is not even in English, 
although you do have a choice of foreign languages. ‘The article in the original 
French is in 


[60] O8uvres Completes d’Abel. 


It first appeared in a German translation in the Journal fiir die reine und angewandte 
Mathematk, Volume 1, 1826. ‘To compound the difficulties, these references will 
usually be available only in university libraries. Yet the study of this paper will 
probably be as valuable as any other reading mentioned here. The reason is 
suggested by a remark of Abel himself, who attributed his profound knowledge of 
mathematics to the fact that he read the masters, rather than the pupils. 


ANSWERS 
TO SELECTED 
PROBLEMS 


CHAPTER 1 


10. 


ite 


12. 
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(i) 
(iii) 


(v1) 


la =a hax) = (a !a)x =} -x =x. 


latin 
D. 2) ip) 2) 5 

lit? Soyo) the 0" me > > — Oc — y) (x + y); so either. — y = or 

x + y =0, that 1s, either x = —y orx = y. 


Replace y by —y in (iv). 


One step requires dividing by x — y = 0. 


oO 
oA rh il : 
+ x 

— ae) 
—_e 4 . 


—e 
SS 


a/b = ab = (ac)(b7'c7!) = (ac)(be)7! (by Gn)) = ac/be. 

(ad + be)/(bd) = (ad + be)(bd)“! = (ad + bc)(b-'d-') (by (iii) = 
ab + cd "=ayb+ c/a. 

ab(a~'b~') = (a ee b-'!) =1,soa7!-b7! = (ab)“!. 
(a/b)/(c/d) = (a/b)(c/d)7' = (a-b7')(e-d7!)“! = (a-b')(c7! - d) = 
ad(b7! ean) = ad(bc)~! = (ad) /(bc). 

x <= —|, 

Or = 7. 

All x, since x7 —2x +2=(¢—-1)° +1. 

x > 3orx <—2, since 3 and —2 are the roots of x7 —x —6=0. 

YS oT —) <x < 3. 

ee. 


ipeOee xe < be 


eR fo a SS 
(jr » 
J A 

_ 

iS 

(= 

<— 


nen 

Ce eS ST 

BA ea = 
ba rd 


_ 
(= 
—e 

SS 


ol + oe : 
$ 7 
es 


_ 
SS 


(vii) 


b—a and d—c are in P, so (b—a) + (d —c) = (b+ d) — (a +c) Is 
in P. Thus,b+d>a-+te. 

Using (i), —e < —d; then (1) imphes that a+ (—e) <b (—4). 

(b—a) and —c are in P, so —c(b—a) = ac—bc is in P, that 1s, ac > be. 
Using (iv), a > 0 and a < 1, so a? <a. 

Substitute a for c and b for d in (vin). 

Veen 3 — V5 V7. 

(ae OV \c| a -4- cl. 

J2+ V3 4 V5 — V7. 

aifa>-—bandb=0: 

—aifa <—band b <0; 

a+2b if a> —b and b < 0; 

—a—2b if a < —bandb=>0. 

ee at x = 0: 

Sy; 2%2 if x= 0. 


c= Fi. =>. 

—6 <x < -2. 

No x (the distance from x to | plus the distance from x to —1 is at 
least 2). 

he |e 

(ey? = Cy) 257 = Py? = (el [x7 since |p| and || oy 
are both > 0, this proves that |xy| = |x| - |y|- 

be /[yl = be Lyi! = bel Ly (by Gi) = bey 4) by @) = e/a. 

It follows from (av) that |x| = |y — ( —x)| < |y| + |y —~x|, so |x| — ly] < 
|x — yl. 

Ix ty +z] < lx t+yl4 lel < lx] +]y] 4 cl. If equality holds, then 
Ix + y| = |x| +|y|, so x and y have the same sign. Moreover. < must 
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have the same sign as x + y, so x, y, and z must all have the same sign 
(unless one is Q). 


CHAPTER 2 1. (i) Since 1? = 1-(2)-(2-1+41)/6, the formula is true for n = 1. Suppose 
that the formula is true for k. Then 
_ K(k + WI(2k + 1) 


Pete +k+ P= SST 4 +1 
k+1 
= kk +1) + 6K +] 


_ kt) 
ae: 


[(k + 2)(2k + 3)] 
_ + tk + HO[+ N+ D 
= ; : 
so the formula is true for k + 1. 
we = 1) = 'T-35 B ect n= 1) 


i=l 


=14+2+4+3+4---+2n—2(1+4+---+n) 
. (2n)(2n + 1) 


5 =n 1) 
=n’, 
5. (a) Since 
2 
l+r= fk 
l-r 


the formula is true for n = 1. Suppose that 


J — rrr! 
i poeueeetes 
l-r 
Then 
} — ttl 
ltrtetr'tr a= l—r De ae 
Z portly pretty —r) 
- l-r 
] — prt 
a 
(b) 
Saber tees tr" 
rS.= Pees ppt pe, 
‘Thus 


SQ) —+) =S-—rS=1—r7™, 
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SO 


i a) 
Ss = ———— 
l-r 
6. (i) From 
GOH 2k = 4 = 6k 4 4k = I, k=1,..., n 


we obtain 


Ge Pegler es 
kl ll oI 


SO 


1)(2 l 
: Gs siya. ig ED 2 se 
ye = 6 2 

4 
ell 


lea’ taal 
oes ie 
(i) From 
| 1 | 


- -_——- lee n 


k k+l] k(k+1) 
we obtain 
l : | 

n+l 2, k(k +1) 

8. | is either even or odd, in fact it is odd. Suppose v7 is either even or odd; 
then 7 can be written either as 2k or 2k + 1. In the first case n + 1 = 2k +1 
is odd; in the second case n + 1 = 2k + 1+ 1 = 2(k + 1) is even. In either 
case, n + 1 is either even or odd. (Admittedly, this looks fishy, but it is really 
correct.) 

9. Let B be the set of all natural numbers / such that m9 — 1 +/ is in A. Then 
lis in B, and/ +1 is nm B if / is in B, so B contains all natural numbers, 
which means that A contains all natural numbers > no. 

12. (a) Yes, for if a+b were rational, then b = (a+b) —a would be rational. If 
a and Bb are irrational, then a + b could be rational, for b could be r — a 
for some rational number r. 

(b) If a = 0, then ab is rational. But if a 4 0, then ab could not be rational, 
for then b = (ab). a7! 

(c) Yes; for example, V2. 

(d) Yes; for example, J/2 and —V2. 

13) (af Since 


would be rational. 


(3n + jas a PCG = Sige 2n)+ 1, 

(3n + Oy = 9n? + l2n+4= 3(3n? +4n+1)4+1, 
it follows that if k* is divisible by 3, then & must also be divisible by 3. 
Now suppose that V3 were rational, and let V3 = p/q where p and 
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CHAPTER 3 


N 


(b) 


q have no common factor. Then p? = 3q?, so p? is divisible by 3, so 
p must be. Thus, p = 3p’ for some natural number p’, and conse- 
quently (3p’)? = 3q2, or 3(p’)? = q?. Thus, q is also divisible by 3, a 
contradiction. 

The same proofs work for V5 and V6, because the equations 


(5n + 1)? = on + [On = 5(5n7 On) lh, 

(51 + Dy Pape Nin el = SiGe 4n) + 4, 

(Sn + OMe = 25n? + 30n + 9 = 5(5n? + 6n + 1) +4, 
(51 + 4)? = 25n* + 40n + 16 = 5(5n2 + 8n + 3)+ 1, 


and the corresponding equations for numbers of the form 6n + 1m, show 
that if k? is divisible by 5 or 6, then & must be. The proof fails for /4, 
because (4n + 2)? is divisible by 4. (For precisely this reason this proof 
cannot be used to show that in general Ja is irrational if a is not a 
perfect square —we have no guarantee that (an + m)2 might not be a 
multiple of a for some m < a. Actually, this assertion 7s true, but the 
proof requires the information in Problem 17.) 
Since 

(2n + ie = $n? + 12n? +6n4+1 = 2(4n3 Gn ck 3n) + I, 
it follows that if k? is even, then k is even. If V2 = p/q where p and 
g have no common factors, then p? = 2q3, so p? is divisible by 2, so p 
must be. Thus, p = 2p’ for some natural number p’, and consequently 
(2p')> = 2q3, or 4(p')? = q3. Thus, g is also even, a contradiction. 


a al . . . . . 
The proof for V3 is similar, using the equations 


(3n + Jeg = 27n? + 27n* 4 On + 1 = 3(9n? 4 On? + 3n) +1, 
9 


Gn 4 2)? = 27 


I 
ii 


+ 54n? + 36n +8 = 309n? + 1802 + 120 + 2) 4-2. 


If w= bithen (lh) 4h)= Ili wh. Supposethat (PF +A)” = l-pen. Then 


(| yer! =(1+A0+h)">04+h0 +nh), since 1+h >0 


kor 


=l4+(n+)hAtnh? >lt(nt+)h. 


h > 0, the inequality follows directly from the binomial theorem, since 


all the other terms appearing in the expansion of (1 + /)" are positive. 


(i 


(x +1)/(x +2); the expression f(f(«)) makes sense only when x 4 —1 
and x 4 —2. 

| fered tou A =] /c if 40), 

(x+y4+2)/a+ 1+ D (for x, y 4-1). 

Only ce iesince f(x) = f (cx) intphes thatte— ex. anaetismausts pe 
true for at least one x 4 0. 

y > 0 and rational, or y > 1. 

0. 

—1, 0,1. 


id. 


Answers to Selected Problems 623 


(i) fgse =< ope 

(ii) {xix 41 and x ¥ 2}. 
(v) @. 

@) 22% 
(ini) 225 F 4 sin (2%). 
(i) Pos. 
( 
\ 


Ht), S10. 
(v) Po PP 
(vil) sososoPoPoPos, 
(a) y. 
(b) H(y) 
(c) A(y) 
(a) 
even odd 
odd neither odd 
(b) 
even odd 
odd 
(c) 
f even f odd 
g even 
g odd even 


(d) Let g(x) = f(x) for x > 0 and define g arbitrarily for x < 0. 

(a) Let g(x) = A(x) = | and let f be a function for which f(2) 4 f(1) + 
fC). Then fo(g +h) 4 fogt+foh. 

(b) [(g+h)o fl) = (g+h)(f)) = g( f(x) +h(f()) = (g 0 f(x) + (he 
fy(x) = [ge +(ho f)]@). 


. l l I ( | ) 
eh y= ——_ —_—_ = — (9(x)):—= | 0 (xX). 
fog’ GEG oF” ie 
(d) Let g(x) = 2 and let f be a function for which fG) #1/f(2). Then 


I/(fog) # fo C/g). 
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vA 
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, « (Soe 
Sipe JS — 
° SS —s 
= 


_ 
et 


eee SS oss ee Se SE 
-“ “ ws ee ee - : : : 
+ ~ Boe 
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—s Sees Tae 
—- 


(iii) 
(iv) 
(a) 


(4): 

(a—€,ate). 

(—2 2). 

(=00,1] U[1, 0). 

All points below the graph of f(x) = x. 

All points below the graph of f(x) = x7. 

All points between the graphs of f(x) =x+ 1 and f(v)=+—- 1. 

A collection of straight lines parallel to the graph of f(«) = —., inter- 

secting the horizontal axis at the points (7, 0) for mtegers 1, 

All points inside the circle of radius | and around (1, 2). 

A square with vertices (1,0), (0, 1), (—1, 0), and (0, —1). 

The union of the graph of f(x) =x and of f(x) =2—--x. 

siitespoint (OF 6): 

The circle of radius V5 around (1.0), singe v4 =2 te ne = (abs + 

yr, 
Simply observe that the graph of f(x) =m(x —a) +b = mx + (b—ma) 
is a straight line with slope m, which goes through the point (a, b). (The 
important point about this exercise ts simply to remember the point slope 
form.) 
The straight line through (a,b) and (c,d) has slope (d — b)/(c¢ — a), so 
the equation follows from part (a). 
When m =m’ and b 4 b’. In that case, there 1s clearly no number x with 
f(x) = g(x), while such a number x always exists if mm 4 m’, namely, 
x = (b' —b)/(m —m’). 
If B = 0 and A ¥ O, then the set is the vertical straight line formed 
by all points (x, y) with x = —C/A. If B 4 O, the set is the graph of 
f(x) = (—A/B)x +(-—C/A). 
The points (x, y) on the vertical line with x = a are precisely the ones 
which satisfy 1.x +0-y+(—a) =0. The poimts (x, y) on the graph of 
f(*) = mx -+D are precisely the ones which satisfy (—m)x+1-y+(—b) = 


The graph of f is symmetric with respect to the vertical axis. 
The graph of f is symmetric with respect to the origin. Equivalently, 
the part of the graph to the left of the vertical axis is obtained by re- 
flecting first through the vertical axis, and then through the horizontal 
aXIs, 
The graph of f les above or on the horizontal axis. 
The graph of f repeats the part between O and a over and over. 

‘The square of the distance from (xx). to (0, i) is 


Bitz. !\ _ one 
TN on A =x" +X aie 
ie l 
=) ae 
—* anc 


\| 
oR 
i) 
ab 
fey 
— 


CHAPTER 5 


(vi) 


(iii) 
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ss “ < Z S) = 
which is the square of the distance from (x, x~) to the graph of g. 
The point (x, y) satisfies this condition if and only if 


(x —a)? + (y— BY = (y—-y)*. 


or 


x* — 2ax +a? + y? —2py + p? = y* —2yy+y?, 


or 


a \_, [o2+62-Y? 
)+( 2p — 2y ) 


If B = y, so that P is on the line L, then the solution is the vertical line 


through P. 


; 
. x7 —8 , 
lim Be lim (x? + 2x +4) = 12. 
c—+2 —Z x2 
. = y" : n—| n—2 n—2 n—| 
lim = lim x" 4+ x" “y +--+: xy" + y 
as: X —-y. xy 
= yr! 4 yi] i aera: ynol = ny" !. 
i at+h— Ja i (Jath—Ja)\(Va+h+Ja) 
wy AA = s Fr 2 SH 
h—0 h h—+0 h(VJa+h+JSa) 
; | 
= im ——— 
h0 /a+h+J/a 
l 


FD Va 


ma , : 
1 = 0. For all x we have | cos(x7)| < 1, so |3 — cos(x*)| < 4, and thus 


|f (x) — O| = |x] - [3 — cos(x?)| < 4- |x]. 


So we can take 6 = &/4. 
1 = 100. We have 
HY 100 


x 


l 
x |x| 


The initial stipulation |x — 1| < 1 makes x > 
then have 


| f(x) — 100] < 200 - |x — 1]. 


So we can take 6 = min(1/2, ¢/200). 


1 
2 


| 
= 100-|~—1] = 100- © tr 


, so 1/|x| < 2, so we 
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10. 


1 = 2. ‘The same sort of argument that was used in the text and in 
number (ni) shows that 


| : 
—-—1| <e for 0 < |x —1| < 6; = min(1/2, €/2), 
x 

so that 
l é 3 ; 
—-l||/< 5 for O < |x — 1| < 6; = mm(1/2, €/4). 
x 2 


Similarly, the solution to number (iv) gives a 52 such that 
lx? —1| <e for 0 < |x — 1] < 8, 


and we have a corresponding 43. ‘Then we can take 6 = min(6}, 42). 


(ai) 7=0, Let = e?. 


(1) 


(111) 


Let 


We need | f(x) — 2| < e/2 and |g(x) — 4| < €/2, so we need 


9) ‘y 
af Boe Saal 
0 x—2 in | sin? [| — —-,—]=6. 
| | << min [sin 6 + 4 
We need 
_ (i4l el4P 
(x) —4 —, 
|2(x) | < min 5 5 


so we need 
0 < |x —2| < [min(2, 8e)]? = 6. 


i = lm f(x) and define g(h) = f(a+h). Then for every ¢ > 0 there is 


Xa 


a 6 > O such that, for all x, if O < |x —a| < 6, then | f(x) —/ < e|. Now, if 

O < |h| < 6, then 0 < |(4+.a)—a| < 6,so|f(a +h) —1| < €. This mequality 

can be written |[g() —1| < e. Thus, him g(h) = 1, which can also be written 
ho 


lim f(a+h) =1. The same sort of argument shows that if lim f(ath) =m, 
h-> ho( 


then lim f(x) =m. So either limit exists if the other does, and in this case 


2a ae | 


they are equal. 


(a) 


(b) 


Intuitively, we can get f(x) as close to / as we like if and only if we can 

get f(x) —/ as close to 0 as we like. The formal proof is so trivial that it 

takes a bit of work to make it look hike a proof at all. ‘To be very precise, 

suppose lm f(x) = / and let g(x) = f(x) —1. Then for all e > 0 there 

ort f 

is a 6 > O such that, for all x, if O < |x —a| < 6, then |f(x) -—1| < . 

g(x) —O| < €, so lim g(x) = 0. The 
ieee Ch 

argument in the other direction is similarly unmteresting. 


This last inequality can be written 


Intuitively, making x close to a is the same as making x — a close to 0. 

Formally: Suppose that lm f(x) = J, and let gv) = f(x — a). Then 
bmn 

for all ¢ > O theré is:a 6 > O such thal, Tor:allia. a O < [x — al) <"d: 

then | f(x) —1| < e. Now, if 0 < |y| < 6, then O < [(v +4) —a] < 6, so 


We 


25 


50: 
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| f(y +a) —1| < e. But this last inequality can be written |g(v) —/| < é. 
So lm g(y) =/. The argument in the reverse direction 1s similar. 
y>0 


Intuitively, x is close to 0 if and only if x? is. Formally: Let him re) 
os 


[, bor every ¢ > 0 there isiam > 0 sich thar if 10) = Ix] <0 (Welt 
[f(x) —1| < e. Then if 0 < |x| < min(1, 8), we have 0 < |x?] < 8, so 
if (°) — t= 2. Thus, lim f (x) = 1, On the other hand, if we assume 


that lim fio) exists, say hm ese = m, then for all e > O there is a 6 
Boe x0 


such that if O < |x| < 6, then fr) —m|<e. Thenif 0 < |x| < 6, 
we have 0 < |2/x| < 8, so [acl e/g?) —m| <e, or |f(x)—m| < e. Thus 
ie Jie 


Wate! Coy li torx =O. and 7) =—1 forx 0) Thea hm ff) wie 


but lim f(x) does not exist. 
) 


The function f(x) = 1/x cannot approach a hmuit at 0, since it becomes 
arbitrarily large near 0. In fact, no matter what 6 > O may be, there 
is some x satisfymg 0 < |x| < 6, but I/x > |/| +¢, namely, any x < 
min(6, 1/(|/| + €)). Any such x does not satisfy |(1/x) —1)| < e. 

No matter what 6 > 0 may be, there is some x satisfying 0 < |x—1| <6, 
but 1/(a — 1) > |/| +e, namely, any x < min(l + 6.1 + 1/(/| + €)). 
Such an x does not satisfy |1/(x — 1) —/| < e. (It 1s also possible to apply 
Problem 10(b): hm Wie bes 1/(x — 1) if the latter exists, so this limit 


does not exist, because of part (a).) 
This is the usual definition, simply calling the numbers 6 and e€, instead 
of € and 6. 
This is a minor modification of (1): if the condition is true for all 6 > 0, 
then it applies to 6/2, so there is an € > O such that if 0 < |x ~a| <e, 
them |7 @)—/| 50/2 <3. 
This is a similar modification: apply it to 6/5 to obtain (1). 
This is also a modification: it says the same thing as (1), smce €/10 > 0, 
and it is only the existence of some € > O that is in question. 


lie him (Ga = lant f (x)= |, then for every ¢ = Outhere are djydan> Opsuch 


Acted io Cla 


that, for all x, 


ier ve Oye them | f @)— 2) = =, 
i a—09 < % <a, them |/(x)—s)| = e. 


Let 6 = min(6;, 52). If 0 < |x —a| < 4, then either a — 6) <a—8 <x <a 
or elsea <x <a+6<a+4), so |lf(x)—-—l1| <e. 


(1) 


ie iim, f(x), then for all ¢ > 0 there is a 6 > Osuch that | f(x)—l| < 


€ for 0 <x <6. If —5 < x < 0, then 0 < —x < 6,s0 | f(—x) —1| <e. 
Thus hm f(—x) = /. Similarly, if lum f(x) exists, then lim f(x) 

x—0- x0 x—O0t 
exists and has the same value. (Intuitively, x 1s close to 0 and positive if 
and only if —x is close to 0 and negative.) 
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CHAPTER 7 


to 


10: 


(111) 


i ho f(x), then for alle > O there is a 6 > O such that | f(x)—l| < 
x— Ot 

Etor Ox <— 6. OOM Oe [yl = 6 then Us|) dee. iis 
hm f(x|) =1. The reverse direction is similar. (Intuitively, if x is close 
dye 

to 0, then |x| is close to 0 and positive.) 

lf i= bm. f(x), then for alle > O there isa é > O such that | f(x)—l| < 
efor 0 <x <6. If 0 < |x| < V3, then 0 < x? < 8,50 WG rl ae. 
‘Thus lim f(x?) =1. The reverse direction is similar. (Intuitively, if x 


. PS em: 
is close to O, then x~ is close to 0 and positive.) 


If? = lim f(x), then for every ¢ > 0 there is some N such that | f(x)—/| < € 


tm 1,8) 


for x > N, and we can clearly assume that N > 0. Now, if 0 < x < I/N, 
then 1/x > N, so |f(1/x) -—1| < e. Thus hm 7 (hx) = “The reverse 


direction is similar. 


(i) 
(iii) 


E(x) — se +82 for all 
(ews) Lor all x. 


Bounded above and below; minimum value 0; no maximum value. 
Bounded below but not above: minimum value 0. 

Bounded above and below. If a < —1/2, then a < —a— 1, so f(x) = 
a+2 for all x in (~a—1.a+1), so a+2 is the maximum and mimimum 
value. If —1/2 < a < 0, then f has the minimum value a’, and if 
a > 0, then f has the mimimum value 0. Since a +2 > (a + 1)? only 
for (-1 _ V5) /2 <a<(-l+ V5 )/2, when a > —1/2 the function f 
has a maximum value only for a < (—1 + J5 )/2 (the maximum value 
being a + 2). 

Bounded above and below; maximum value |; minimum value 0. 
Bounded above and below; maxiunum value 1; minimum value —1. 

f has a maximum and minimum value, since f 1s continuous. 
n=—2, since f(—2) < 0 < f(-1). 

n=-—l,since f(—l1) =—-1 <0 < f(O). 

if (Cee 1687 ei? sin?.x) — 119, then f is continuous on 
Rand f(2) > 0, while f(—2) < 0, so f(x) = 0 for some x in (—2, 2). 


f is constant, for if f took on two different values, then f would take on all 


values in between, which would include irrational values. 


1) 
2) 
3) 


( 
( 
( 
(4) 


[oS 
fQ@) = —%;5 
f(x) = [a]; 
ff) = —|xI. 


Apply ‘Theorem | to f — g. 


CHAPTER 8 


le 


i) 


10. 


13h, 
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it 7) = 0 or /@) = de chasse x = 0 or deelian@®) > 0 = 1(0) and 
fC) < 1=/(), then Problem 10 applied to f and / implies that f(x) = x 


for some x. 


(i) 1 is the greatest element, and the greatest lower bound is 0, which is 
not in the set. 

(in) 1 is the greatest element, and 0 is the least element. 

(vy) Since {x :x?+x+1 > 0} =R, there is no least upper bound or greatest 
lower bound. 

(iy poinge (ee x =< Orand x° a he) = ee J5 4/2. 0), the greatest 
lower bound is [—1 — /5]/2, and the least upper bound is 0; neither 
belongs to the set. 

(a) Since A # Y, there is some x in A. Then —x is in —A, so —A F VF. 

Since A ts bounded below, there is some y such that y < x for all x in 
A. Then —y > — x for all x in A, so —y > z for all z in —A, so —A is 
bounded above. Let @ = sup(—A). Then @ is an upper bound for —A, 
so, reversing the argument just given, —@ is a lower bound for A. 

Moreover, if 6 is any lower bound for A, then —£ is an upper bound for 
—A,so —B = a, so"B = —a. Thus =e is the’greatest lower bound for A. 

(a) If 7 is the largest integer with / < x, then/+1>x,but/+1<x+4+1 <y. 
So we can let k = / + 1. (Proof that a largest such integer 7 exists: 
Since N is not bounded above, there is some natural number n with 
—n <x <n. There are consequently only a finite number of integers / 
with —n </ < x. Pick the largest.) 

(b) Since y — x > O, there is some natural number nv with I/n < y — x. 
Since ny — nx > I, there is, by part (a), an integer k with nx <k < ny, 
which means that x <k/n < y. 

(c) Choose r + Xs — ry 2 

(d) By part (b), there is a rational number r with x < r < y, and therefore 
a rational number s with x <r <s < y. Apply part (c) tor <s. 

Let k be the largest integer < x/q@ (the solution to Problem 5 shows that such 

a k exists), and let x’ =x —ka > 0. If x —-ka=x'>a, thenx > (K+ le, 

so k +1 < x/a, contradicting the choice of k. So0 < x’ <a. 

(a) Since any y in B satisfies y > x for all x m A, any y in B is an upper 
bound for A, so y => sup A. 

(b) Part (a) shows that sup A is a lower bound for B, so sup A < inf B. 

Since x < sup A and y < sup B for every x mn A, and y in B, it follows that 

x+y <sup A-+sup B. Thus, sup A + sup B is an upper bound for A + B, so 

sup(A + B) < sup A+sup B. If x and y are chosen in A and B, respectively, 
so that sup A—x < €/2 and sup B—y < e/2, then sup A+sup B—(x+y) < «. 

Hence, 


Sepian >) = At py spe 1 Stl = ie. 
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CHAPTER 9 fart fay 


I I 


fla+h)— fla) laa Gh “a 
h h—-+0 h 
—|] | 


rae a(a +h) a’ 


Po) = fi 


(b) The tangent line through (a, 1/a) is the graph of 


ee | 1 
g(x) = —@ - a) +— 
a a 
— gz a 
If f(x) = g(x), then 
] 2 2 
—-=-—s+- 
x a~ a 
or 
x? —2at +a* =0, 
SO. X =a: 
l | 
pe GT — Fie). 1. ane = a 
742) = hm h a 


lim (—2ah — h?) 2 

=u TTT ES On. 

h>0 ha?(a +h)? aa 

The tangent line through (a, 1/a?) is the graph of 


2 | 
g(x) = —(* —a) + 
a- 


a2 

eS 

=— a Poly 
If f(x) = g(x), then 

ih eee 8 

x) 6G 
or 

2x3 — 3ax? +a? = 0, 
or 
O=(x - a)(2x? —ax — a’) = (x —a)(2x +a)(x —a). 

So x =a or x = —a/2; the point (—a/2, 4/a’) lies on the opposite side 


of the vertical axis from (a, 1/a?). 
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é) 
zs ... fla+h) — f(a) 5 Vath —. fa 
f(a) = bn AANA —K— = Lim —_\—— 
: h—0 h h—0 h 
= lim SST NESTE Ae) SHR AN OE BE AA) = lim pak. 
h>0 h(Ja+h+ Ja) h>0h(/a+h+ Ja) 
] 


4. Conjecture: S,/(x) = nx"~!. Proof: 


Sn(x +h) — S,(x) (x +h)" — x" 
———————— . 7 


S,'(x) = lim lin 

h—( h h—O h 
n 
yD (Qi — x" 
J 

‘ j= 

= lim 
h—O h 


n 


‘ n Et Sy tee 
= lim ) Eames ae 
h—0 * J 
j=l 


n & fe : : _- ‘ 
= (7). Tnx") since lim h/~' = 0 for j > 1. 


I~ 
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eA “(x) = 0 for x not an integer, and f’(x) is not defined if x is an integer. 
gs S' 


6.» » fa) 
x +h) — g(x x+/] Ts x ; 
g(x) = lim en ae = lim [Ger Ase aL = Lite se] aL = Lee] 
h—-0 h h—+0 h 
if x } r2e x 
= lim SQ +h) — FO) os F &): 
h—0 h 
(b) 
. g(x +h) — g(x) . cef(x+h)—cf (x) 
2 (x) = hm AA AAAAK—K = lim oO x jj—-— 
h—0 h i) h 
g x h 5 ap x 
a) any lim FQ +h) —FIR) — ameer 
h—-0 h 
7. (a) f'(9) =3-97; f’(25) =3- (25); f'(36) = 3 - (36)?. 


9 


(b) f’(37) = f’(9) =3-97: f'(5*) = f’(25) =3- (25)*; f'(62) = f’(36) = 


3 - (36). 
(c) f’(a*) = 3(a*)? = 3a4; f'(x?) = 3(x2)? = 3x74. 
(d) f'(x*) = 3x4; but g(x) = x®, so g/(x) = 6x°. 
8. (a) 


g(a +h) — g(x) _ es fe FAH f& Hc) 
h , h20 h 

. f([x+e] +h) — f(x+ec) 

= lin —- ——— - —_ 
h—0 h 


g(x) = lim 
h-0 


= f'(x+c). 


632 


Answers to Selected Problems 


CHAPTER 10 


30. 


(b) 
ow es) nee f(ex + ch) — flex) 
h a0 h 
CU Remmi CH) Se yells Wont) (cx 


ee) A bion 
h—0 


= hm ln 
h>0 ch k=0 k 
x +k) — flex 
=e len flex +) — flex) =c- f(x). 
k—0 k : 


(Compare the manipulations in this calculation with Problem 5-14.) 

(c) If gx) = f@ + @pthen gx) = f(x +4), by part {a). But g = f, so 
f'(&) = g'(x) = f(x +a) for all x, which means that f’ is periodic, with 
period a. 

G) If g(x) = x°, then g’(x) = 5x4. Now f(x) = g(x + 3), so by Prob- 

lem 8(a), f’(x) = g’(x +3) = 5(x +3)". And f’ +3) = 5x + 6)4. 
(meer) = (1 —3P"sorf Gey = 5 — eas part (i). And f’(x+3) = 5x74. 
any fe) = (a + 2) 'S.souf Co = 7 4 2)" as in part @). And f’(x +3) = 
Teast 5)°. 

If f(&) = g(t+x), then f’(x) = g9'(f++), by Problem 8(a). If f(t) = g++), 

then f’(t) = g'(t + x), by Problem 8(a), so f’(x) = g’(2x). 

(lips) — ct?, then s’(t) = 2ct, and there is no number & such that 
SG) = est) [that ise 2em— her" forall 7, 

(By the way, at this point we do not know any nonzero function f for 
which /’ is proportional to f. After Chapter 18 it might be amusing to 
determine what the world would be like if Galileo were correct.) 

(b) . @) If s(t) = (a/2)t?, then s’(t) = at, so s"(t) =a. 

(ii) is (4)|- = (at)? =2a- (a/2)t? = ease 

(c) The chandelier falls s(t) = 1617 feet in t seconds, so it falls 400 feet in 
t seconds, if 400 = 1677, or t = 5. After 5 seconds the velocity will 
be s/(5) = 5a = 5-32 = 160 feet per second. The speed was half this 
amountamien 80 = 5 (7) = 327, or = 3. 

(a) ‘This is another way of writing the definition (see Problem 5-9). 

(b) ‘This follows from Problem 5-11, applied to the functions @(h) = 
| Lian sf (a)\ fe and B@) =\(e@ Am) — (ie) me 

(i) f(x) = 6x. 

(iii) f(x) = 4x3. 

(i) means that f’(a) = na"—! if f(x) = x". 

(iil) NIGATIE Uae) = f'(a) if gtx) =e (x) 4+ c. 

(v) means the same as (ii). 

(vii) means that g’(b) = f(b +a) if g@) = f(xt+a). 

fix) “itteahSutliat«p) — «ef (ch) if e(1) = f(ca)s 


(i) (1 + 2x) + cos(x + “Vay, 
(it) (—sin.x) + cos(cos x). 


COSX —*x sin.x — Gosx 
COs . : 


5 
A 


en i, 
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(vii) (cos(x + sinx))- (1 + cosx). 
(i) (cos((x + 1)? @ +2): [24 + D@+24+@+4 1’. 
(in) [2sm((x + sin x)*) cos((x + sin x)*)] -2(x + sin x)(1 + cosx). 
(v) (cos(x sin x)) - (sinx + x cosx) + (cos(sin x2)(cos x2)) - 2x. 
(vil) (2 sin x cos x sin x2 sin? x2) + (2x cos x? sin’ x sin? x2) 

+ (4x sin x? cos x? sin? x sin x). 
(ix) 6(x + sin? x)5(1 +5 sin* x cos x). 
(x1) cos(sin’ a + ie -7(sin! Fs hy acal @/ sin® x7 - cosx! - 7x°). 
(xiii) cos(x? + sin(x? + sin x7)) - [(2x + cos(x2 + sin x7) + (2x + 2x cos x))]. 
(1 + sin x)(2x cos x? - sin? x + sin x2 - 2 sin x cos.x) — cos x sin x? sin? x 

(1 + sin x)? , 


>. 
all, x 3 a 3x2 sin x — x? cos x 
3x“ sin | — — x° cos { = . | “|. 
sIn xX sin Xx sin~ xX 
ys 


( (x + 1): 
(x + 2)? 
(iii) 2x? 
(i)  —x?, 
(iii) 17. 
(Gi) f'@) = 8'@ + g(a). 


Gu) fis) — 2 (ae oO) (1 + g(x): 
(v)  f'() = g@). 

) A(t) = 2rr(t)r'(t). Since r'(t) = 4 for that ¢t with r(t) = 6, it follows 
that A’(t) = 22 -6-4 = 48m when r(t) = 6. 
(b) If V(r) is the volume at time f, then V(t) = Anr(t)?/3, so V(t) = 

Anr(t)2r'(t) = 4 - 62-4 = 576m when r(t) = 6. 
(c) First method: Since A’(t) = 2n¥ G@re),and A’ (y= tor 7) = 3, it 
follows that 
A(t) 


: 5 
Oe De8e) = Gy when r(t) = 3. 


‘Thus 
V(t) = 4nr(t)?r'(t) 


5 
me Gp 
x-9 a 


= 30 whenr(t) =3. 
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‘To apply the second method, we first note that if 


f(t) = Aa? = JA. 


then, using Problem 9-3 and the Chain Rule, 


f= ea 3A(t)°A (t) 


2/ A(t)? 


l AY 


3 , : 
= = A(t)! 2 A'(t) (just as we might have guessed). 


Now 

4nr(t)? — 4ar[r(t)7]3/? 
leas 3 

J A[ar(t)?]>/? 

ae 3771/2 
4A(t)>/? 


3771/2 © 


V(t) = 


So 


Fink, So 
Vit) 3x12 ° By OA (1) 


1/72 a! r(tA'(t) 
a TS 


= 23 3930. 
10. (i) (foh)'(0) = f’(h(O))-h’(0) = Ff’) - sin2(sin 1) = 
[6sin 5 — cos 5] sin*(sin 1). 
(11) a(x?) = h'(x4) - 2x? = sin? (sin(x4 + ]))- 2x2, 
12. The Chain Rule implies that 


l / 
(_) (x) = (f og)'(x) = f'(g(x)) -2'(x) 


| 
= ———~ «g'(x). 
g(x)- : 
lz. @@ dy 
35. (i) = = zs . = = (cos y) - (1 + 2x) = (cos(x + x7)) (1 + 2x). 

iil) sl oe ds (cosu) - (cosx) = (cos(sin x)) - (cos x) 
SSE SS Ee Os ° Sx) = © "(COS X). 

(i Won Heh Ge s cos 

CHAPTER 11 1. (i) O= f’(x) = 3x? — 2x —8 forx =2 and x = —}, both of which are in 

[—2, 2]; 


f(—2) =5, £2) =—I1, f(—4) = 3; 


° 203 * 
maximum = 45>, minimum = —I1. 


N 
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(iit) ORS pea) 4? + 1x = 12e(x2 — 2x 4 1) for 2 = 0 and 
x = 1, of which only 0 is in [-5. 5]; 
f(-3) = Te FQ) = 1 FO) = 0; 
maximum = 8 nunimum = 0. 
(Vv) O= f(x) = 
x*+1—(@41)2x _1-2x-x? 
(x2 + 1)? (x2 41)? 
for x = -14 V2 and x = —1— x oy of which only —1 + V2 is in 
= or 
f(-) =0, f() =$% fe-l4+ v2) = 4 V2)/2; 


maximum = (1 + /2)/2, mimimum = 0. 


(1) —3 is a local maximum point, and 2 is a local mmimum point. 

Qi) Quis a local mmimum poimt, and there are no local maximum points. 

(vy) —-I+ V2 is a local maximum pomtand — ie — V2 is a local minimum 
pomt. 


(a) Notice that f actually has a minimum value, since f ts a polynomial 
function of even degree. The minimum occurs at a point x with 


n 
Oat (4) — 2 ie — aj), 
i=l 
so X = (ay +---+a,)/n. 

(i) 3 and 7 are local maximum points, and 5 and 9 are local mmimum 
points. 

(it) All irrational x > O are local minimum ports. and all trrational x < 0 
are local maximum poimts. 

(v) x isalocal minimum point if its decimal expansion does not contain a 5, 
It is a local maximum point if its decimal expansion contains exactly 
one 5 that is followed by an infinite string of 9’s. In all other cases, x is 
both a local maximum point and a local minimum point. 

If f(x) is the total length of the path, then 


ie — Vx? + ad t/a 


The positive function f clearly has a minimum, since lim f(x) = hm f(x) 
Na OS, eters ne 
= oo, and f is differentiable everywhere, so the mimmum occurs at a point 


x with f’(x) = 0. Now, f(x) = 0 when 
]-. 
A — i) 


x 
Vx2- +a? Glee = 


This equation says that cos@ = cos p. 

It is also possible to notice that f(x) 1s equal to the sum of the lengths of the 
dashed line segment and the Ime segment from (x, 0) to (1, b). This is short- 
est when the two line segments he along a Ime (because of Problem 4-9(b), if 
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10. 


LH. 


21 
28. 


a rigorous reason is required); a little plane geometry shows that this happens 
when @ = B. 


If x is the length of one side of a rectangle of perimeter P, then the length 
of the other side is (P — 2x)/2, so the area is 


x(P — 2x) 
a 


So the rectangle with greatest area occurs when x is the maximum point for f 
on (0, P/2). Since A is continuous on [0, P/2], and A(O) = A(P/2) = 0, 
and A(x) > 0 for x in (0, P/2), the maximum exists. Since A is differentiable 
on (0, P/2), the minimum point x satisfies 


P —2x 


A(x) = 


0=A'(x) = 


P-—4x 
ma 


so x = P/4. 
Let S(r) be the surface area of the right circular cylinder of volume V with 
radius r. Since 


V=ar-h where h is the height, 
we have h = V/zr*, so 
S(r)= Qnr? +2nrh 
= 2nr? + ai 


We want the minimum point of S on (0, 00); this exists, since lim Str) = 
'- 


lim S(r) = oo. Since S is differentiable on (0, 00), the minimum point r 
r—>Co 


satisfies 
2V 
0 = S'(r) =4ar — = 
Py 
a 4nrr? —2V 
— 2 : 
or 
oper 
"ON Biggs: 


1 is a local maximum point, and 3 is a local minimum point. 
(a) We have 


f(b) — fla) 


= f(x) for some x in (a, b) 
b—a 


> M, 


so f(b) — f(a) = M(b—a). 


ie) 
to 


(I>) 
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We have 


f(b) — f(a) 


=u fa(os) mm forsomicen in ag) 
DG 


<n, 


so f(0) — f(a) = ih(b —@). 
If | f'(x)| < M for all x in [a,b], then —M < f(x) < M, so 


f(a) -— M(b-a) < f(b) < f(a+M(b—-a), 


or 


| f(b) — f(a)| < M(b— a). 


f(x) = —cosx +a for some number a (because f(x) = — osx is one 
such function, and any two such functions differ by a constant function). 
f'(x) = x4/4+4 for some number a, so f(x) = x°/20+ax +b for some 
numbers a and b. 

f' @) =<? 2 x°¥3 efor some a, sof’) = w/6-+ x /12 4 axe 
for some a and b, so f(x) = x*/24 + x°/60 + ax7/2 + bx +c for some 
numbers a, b, and c. Equivalently, and more simply, f(x) = x+/24 + 
x°/60 + ax* + bx +c for some numbers a, b, and ec. 


Since s"(t) = —32, we have s'(t) = —32t +a for some a@, so s(t) = 
—16t7 + at + B for some a@ and B. 
Clearly, s(0) = 0+0+4+ 86 and s’(0) =0+a. Thus, aw = vg and B = so. 


: > 5 : 
In.tlis case, sg =.0 and.vp.=.v, so s@) = =lofe + zt. The maximum 
valuc ols occurs when 0 = sj = 327 -- ev, or? = v/ 32. Some 
maximum value is 


(33) 


= (belles ao) 


At that moment the velocity is clearly 0, but the acceleration is —32 (as 
at any time). The weight hits the ground at time t > 0 when 


QO = s(t) = —1627 + vt. 


or t = v/16 (it takes as long to fall back down as it took to reach the top). 
The velocity 1s then 


s'(v/16) = —3 


ei — |)! 


N 
aaa 
al 
See 

la 


(the same velocity with which it was mitially moving upward). 
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47. Apply the Mean Value Theorem to f(x) = /x on [64, 66]: 


66 — V64 | 
uae =f (oe) ae for some x in [64, 66]. 


Since 64 < x < 81, we have 8 < /x < 9, so 


| V66—8 | 
0 wmieay Lee 
51. lHopital’s Rule does not lead to the equation 
. 3x24] en OE 
lim = lim — 
t= | Le A=? | 


because lim 3x7 +140. 
> 


52. (i) 
lim —— = lim —— = lim cos*x = 1. 
x0 tan x x0 Sec* xX r0 

(ii) 
9 . 
.. cos*x — | . —2sinx cosx 
im —_— = lim. —___—_ = -1. 
x30 x" x0 De 
CHAPTER 12 l @) f-@) = @-—-—1)!'3. (if y = f-'@), then x = f(y) = y* +1, so 


y=(@ —1)'3)) 
(iii) f-'= f. (If y = f—'(x), then 


y, y rational 
i= (ly) = Be 
—y,  y irrational; 


since +y is rational or irrational if and only if y is, we have y = x if x 


is rational and y = —.x if x is irrational, so y = f(x).) 
(Vv) 
X D de As 1 mere An 
fo) = aj_| C=O" WD. occn n 
Ay ee 
wi) f=, 


tr 


(i) f-! is increasing and f~'(x) is not defined for x < 0. 
(iii) f~! is decreasing and f~'(x) is not defined for x < 0. 
3. Suppose f is increasing. Let a < b. Then f-'(a) 4 f—'(b), since f—! is 
one-one. So either f~'(a) < f~'(b) or f-'(a) > f-'(b). But if f-'(a) > 
f-'(b), then 

b = f(f-'(b)) < f(f'(a)) =a, 


a contradiction. ‘The proof is similar for decreasing f, or one can consider 
—/f instead. 

4. Clearly, f + g is increasing, for if f(a) < f(b) and g(a) < g(b), then 
(f + 2)(a) = fla) + g(a) < f(b) + g(b) = (f + g)(b). 
f +g is not necessarily increasing; for example, if f(x) = g(x) = x. (But f-g 


11. 


Answers to Selected Problems 639 


is increasing if f(x), g(x) > O for all x.) 
f © g is increasing, for if a < b, then g(a) < g(b), so f(g(a)) < f(g(b)). 
(a) If (fog)(x) = (f og)(y), so that f(g(x)) = f(g(y)), then g(x) = g(y), 
since f is one-one, so x = y, since g is one-one. 
(fog)! =g-!o f-!: for if y = (f o g)7'(x), then x = (f og)(y) = 
F(g@)); sox) = f-*@).s0 y= ef 1@)). 
if f(x) = f(y), then 
axTb ay-+b 
cx+d  cy+d’ 


sO 
acxy + bcy + adx + bd = acxy + ady + bex + bd, 
or 
ad(x — y) = bc(x — y). 

If ad F be, this implies that x — y = 0. (But if ad = be, then f(x) = f(y) 
for all x and y in the domain of f/f.) 
If y = f—'(x), then x = f(y), so 

. yD 

~ eyt+d 


so 
—dx +b 
f'@=y= ieee for x # a/c. 
cx —a 
(i) ‘Those intervals [a,b] which are contained in (—oo,0] or [0,2] or 
[2, 0), since f is increasing on (—oo, 0] and [2, 00), and decreasing 
on [0, 2]. 
(iii) “hose intervals [a,b] which are contained in (—oe, 0] or [0, 00), since 
f is increasing on (—oo, 0] and decreasing on [0, 00). 
Since 
(fF 'Y@) 
= SS 
AW a 3) 
we have 
ee =f 2G) 
(yx) = SE ES a Am 
(FCF AG) |= 
_ —f"(fo1@)) 
ai pasts 69 5) 


The formula for the derivative reads: 


dx _ ] 
ys 
dx 


(In this formula, it is understood that dx/dy means (f~')/(y), while dy/dx 
is an “expression involving x,” and in the final answer x must be replaced 
by y, by means of the equation y = f(+).) 
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21. 


22, 


CHAPTER 13 l. 


The computation in Problem 20, when completed, shows that 


dx ii” l l 
dx = n(x!/nyjn-l “= nxt-(l/n) 
| 
= ax /nj-1 
n 


Gix)=x(f')'@) EF 16) — Ff") “GF -YV@) 
xt FY Cf) — FF 2 ON) 
=x(f)'@) + f7@) —xif 1Y@) 


= Fite): 
(1) 
| | l 
6 YO) =—— = ee 
h'(h-"(3))—W'(O) sin? (sin. 1) 
Fe eas | ee t,} is the partition with 4 = ib/n, then 


n 


L(f, Pra) = > Gear? (Gh — 44) 


i= 


ram ”- on 
n—| 

OS 
~ 

n ar 
a b* | (n —1)4 fr (n —1)3 a (n — 1)? 
on 4 2 4 : 

and similarly 
U(f. nyo 3S fe 
ny = 4 7 | 


Clearly L(f, P,) and U(f, P,,) can be made as close to b4+/4 as desired by 
choosing ” sufficiently large, so U(f, P,) — L(f, Py) can be made as small 
as desired, by choosing n large enough. This shows that f is integrable. 
Moreover, there is only one number a with L(f, P,) < a < U(f, P,) tor 
all n; since Se x3 dx has this property, the proof that fs xedx = b*/4 will 
be complete once we show that L(f, P,) < b4*/4 < U(f, P,) for all n. This 


Answers to Selected Problems 641 


can be done by a straightforward computation, but it actually follows from 
the fact that L(f, P,) and U(f. P,) can be made as close to b+/4 as desired 
by choosing n sufficiently large. In fact, if it were true that b4/4 < ie dx 
then it would not be possible to make U(f, P,) as close as desired to bt /4 by 
choosing n large enough, since each U(f, P,) = fe x3 dx, and sunilarly we 
cannot have b+/4 > ss Eeghe 

We have 


Nr 


Gh Py} = 


no 5) 2 3 30 
: b> |n nt wr n 
UF. Pi) = ee oath a= an , 
n 2 


Clearly Lf?) and U({, P;,) ean’ be macevas close to b>/5 as desired by 
: een. I 

Sate ie n large enough. As in Problem 1, this implies that fp xtdx =b?/5. 

eK) Rf, f = 0: 

o 2 

nee Bef =. 

(v) (f 1s not integrable. 

ee 

Sie 7 ak 
(Kor a rigorous proof that the functions in (1), (ii), and (vil) are integrable, 
see Problem 19. ‘The values of the integrals, which are clear from the 


b° = 1)° af @ — 1) i Co ie 7 (n — 2) 


geometric picture, can also be deduced rigorously by using the ideas in 
the proof of Problem 19, together with known integrals.) 


(¢q, — oe Saal a 
J2/2 3 


ia 2/2 


fw —2x+4)- Re Aca. 
0 


b d b d 
i (/ Fear) dx =") (se f eidy dx (here f(x) is the constant) 


d b 
-| g(y)dy- Fide 


d 
(here / g(y) dy is the constant). 
- 


13.* (a) Clearly L(f, P) = 0 for every partition P. 
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(b) Apply part (a) to f — g, and use the fact that 
b b b 
[u-o=f s- fos. 
23. (a) Clearly 


m(b—a) < L(f. P) < U(f, P) < M(b—a) 


for all partitions P of [a,b]. Consequently, 


b 
m(b—a)< i f(x)dx < M(b—a). 


‘Thus 


satisfies m < pu < M. 
(b) Let m and M be the mimimum and maximum values of f on [a,b]. 
Since f is continuous, it takes on the values m and M, and consequently 
the number sz of part (a). 
33.. (a) 0. 
(b) 5. 
37. Since 
AA St Sif; 


b bh b 
-| vis [ors fl. 


we have 


so 
b b 
| [sl<fin 
b 
(Problem 36 imphes that | f| makes sense.) 
CHAPTER 14 1. (i) (sin? x3) - 3x2. 
*"3 | 
ii [ ea 
8 1+r2+sin°t 
6 l 
(v) / = 7 aS dt. 
a l+t-+sin°t 
| 
=e, F-! Kx) = = F-} x). 
(vu) (FO ")' (x) F(F-"a)) (x) 
2 (i) Allee #1; 
(ii) Allx £1. 
(v) All x. 
(vii) All x 4 0. (F is not differentiable at 0 because F(x) = 0 for x < 0, but 


there are x > 0 arbitrarily close to 0 with fut = LG 
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4. (i) 
| l 
SAS IN (0) SS Se ee SS ES 
(fF -y@) f'(f-(0)) 1 + sin(sin( f-!(0))) 
l 
= l. 


| + sin(sin 0) . 


rot 


) 


8: F(x) =f f(t)dt,s 
0 


F'(x) =xf(x)+ ik f(t)dt. 
0 


x y z l 
f(x) =) i! (/ * i) dy. 
0 ( 0 0 V1+sin7¢ 


13. We can choose 


xl /n)+l 
ixy= 

] 

—+1 

n 
Then 

b pbU/n+ 
/x dx = f(b) — f(0) = i 
0 = a4 
n 
CHAPTER 15 Pe) 
| ] | 
l+arctan?(arctanx) 1l-+arctan?x 1+x2° 
(111) 
| > tan.x 
Ss (ee IR En See =e ve 
| + (tan x arctan x)- l1+.x- 
Za i 
(ii) 0. 
(v) 0. 
dike) 


sin 2x = sin(x +x) =sin.x cosx + cos x sinx = 2sin x cos x. 
2 2 2 2 
cos 2x = cos* x — sin* x = 2cos* x —1= | —2sin*x. 
sin 3x = sin(2x +x) =sin 2x cosx + cos 2x sin x 
« 9 °) oD < 
= 2sin x cos~ x + (cos~ x — sin* x) sin x 
. 2 On| 
= 3sin x cos. x — sin> Xx. 
cos 3x = cos(2x + x) = cos2x cos x — sin 2x sin x 
2 2 5 ed 
= (cos’ x — sin” X) cosx = 2ZsIn° x cosx 
3 ay ae 
= cos x — sin’ x cosx — 2sin* x cos x 
3 2 
= cos’ x — 3sin* x cos x 


3 
= 4cos’ x — 3:cos x. 
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9; 


(b) Since cos 2/4 > 0 and 


(a) 


1 
0 = cos = = cos2- 


we have cos7/4 = J2/2. It follows, since sinz/4 > O and sin? + 
cos? = 1, that sina /4 = /2/2, and consequently tan 2/4 = 1. Sim- 
ilarly, since cos 7/6 > 0 and 


0 = cos — = cos3- 


we have cos 2/6 


V1 —(¥3/2)? = 


V3/2. It follows, since sinz/6 > 0, that sinz/6 = 


rie lI 


sin(x + y) 


tan(x + y SSS = 
( ¥) cos(x + y) 


sin x cos y + cos x sin y 
COs x COS y — SIN X sin y 
sinxcosy  cosxsin y 
cosx cosy  cosx cosy 
cosxcosy sinxsiny 
COS X COS Y ~ €OS.X COS y 
tanx + tan y 

1 —tanxtany 


(b) From part (a) we have 


tan(arctan x) + tan(arctan y) 

tan(arctan:x --:arctan:y) = ——q_e_e_ccwme—_ 

| — tan(arctan x) tan(arctan y) 
x+y 


a 


provided that arctan.x, arctan y, and arctan x + arctan y # kw + 7/2. 
Since —7/2 < arctan x, arctan y < 7/2, this is always the case except 
when arctan x + arctan y = +27/2, which is equivalent to xy = 1. From 
this equation we can conclude that 


arctan x + arctan y = arctan (=) 
—xy 
provided that arctan x +-arctan y lies in (—2/2, 2/2), which is true when- 
ever xy < 1. (If x, y > O and xy > 1, so that arctan x + arctan y > 2/2, 
then we must add z to the right side, and if x, y < 0 and xy > 1, so 
that arctan x + arctan y < —2/2, then we must subtract 77.) 
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11. ‘The first formula is derived by subtracting the second of the following two 
equations from the first: 


cos(m — n)x = cos(mx — nx) = cosmx cos(—nx) — sin mx sin(—nx) 
= cosmx cosnx +sinmx sinnx, 
cos(m + n)x = cosmx cosnx — sinmx sinnx. 


The other formulas are derived similarly. 
12. It follows from Problem 11 that if m 4 n, then 


Ww ] W 
/ sin mx sinnx dx = = [cos(m — n)x — cos(m + n)x] dx 


> G 2 —7 
l sin(m —n)x  sin(m+n)r 
= m—n ‘ m+n 
sin(m —n)x sin(m+n)z 
-| m—n i m+n |} 
0) 


(Note that sin(m — n)(—21) = sin(m — n)z since m — n is an integer.) But if 
m =n, then 


w ] 4 
/ sin mx sinnx dx = 5) / 1 — cos(m + n)x dx 
= | 


1 {| sin(m ren sin(m || 
== {|x — ——— | - | -2 — ————_ 
2 m+n m+n 


= TI. 


The other formulas are proved similarly. 
15. (a) We have 


2 Puy 
cos 2x = cos x — sin“ x 
we OD 
= | —2sin“x 


2 


= 2cos“ x — l. 


So 


1 — cos 2x 
2 

1 + cos 2x 

sien 

(b) These formulas follow from part (a), because cos x/2 > 0 and sin x/2 > 0 
(since 0 < x < 2/2). 


wy 
sn xX = 


4 
cos’ x = 


(c) 


e > 1 —cos2x l l 
/ sin? x dx =| dr 9b — a) — g (sin 2b — sin 2a). 


° > 14 cos2x l | 
[ cotxax= | EE dx = 5(b —a) + z(sin 2b — sin 2a), 
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CHAPTER 18 


19: 


20. 
Pa 


(a) arctan 1 — arctan 0 = 7/4. 


(b) 


X00 x—0* 
(a) 1 wx 1 
(sin’)/(x) = Ts0 °° es) = 180 cos (x). 
(cos’) (xr) = Te + sin (77) = — sin®(x) 
cos’) (x) = 735 -—sin (795) = Teg Sin): 
B) hig sin’ x ats sin(ax/180) Lane a sin(rx/180) ah 78 
0) X x50 x ~ +0180 2x/180 180° 
lim x sin” i = lim ce LS 
X00 x x30 180 
(i) a net. ef e*, 
(iii) (sin.x)""S™ | (log(sin x)) - cos(sin x) « cos x 
+ (cosx/sin x) - sin(sin x)] . 
; . sin x 5 £ A F . COS X 
(v) (sinx) 82) in y* fogs.) cos logis x) + sina 7 
sin x 
; GOSH 
+ (sinx)""* - =. 
sin x 
re . x log(sin e") . x (cos e* )e* 
(vil) [aresin ( ae )| (log (aresin 1) era a 
a - sinx — x cos.x 
+ log(sin e*) - ——oo¢5555sSS—. 
ee 2 (ee 
arcsin () l (= =| sin~ Xx 
(ix) (lo x)!98* . | loo (lo Preise x- a 
8° Bet) S logx x] 
(xi) cos(x" 9) sine") cos(xs"*) Sine , log x 
| “I ad 
Cost 6g 
Xx x 
i) 0. 
(iii). 
(v) 3 


lim arctan x — arctan 0 = 7/2. 


xX OO 


F agi i hes. 
lim xsin — = lim —sinx = 1. 
x x 


Answers to Selected Problems 647 


8. (a) 


2 bern 
cosh* x — sinh* x = 


(c) 


inh x cosh y + cosh x sinl ag el ) al eh a (2 -¢7 ) 
sinh x cosn vy cosh x smn y => | —_— — —___—— 
; 2 2 2 2 


a erty ey eutty ie ee) y erty ex ¢ eo*ty ey 
74 4+ + 4 4 4 4 4 


exty = ety) 
= ——___———_ = sinh(x+ y). 


2 
(e) Since 
J e* + Qa 
sinh x = 
we have 
ip * e* —e-* 
sinh (x) = == cosh x. 
(g) Since 
sinh x 
tanhx= : 
cosh x 


we have 


? ; 2) 
os (cosh.x)- — (smh x)- 
tanh (x) = ————.———_ 
cosh> x 
| 
= 5 by part (a). 
cosh” x 


Sr fae ier cosh~!x, then y > 0 and 


x =coshy=V1+ sinh? y by Problem 7(a). 


So 
sinh(cosh~! x) = sinh y=vx2-1. since sinh y > 0 for y > 0. 
(c) 
ie SP: I 
(sinh ")'(x) = eT Rs PS 
sinh (sinh “(.x)) 
| 


cosh(sinh~ | (x )) 


= ———__ by part (b). 


V1+x? 
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(tanh~')(x) = ————_—., 
‘ihe tanh’(tanh~ !(x)) 


oe cosh?(tanh~!(x)). 


Now, 
tanh? y + 5— = 1 by Problem 8(b), 
cosh* y 
sO 
tanh?(tanh~!(x)) + —.—_—__—— = |]. 
cosh?(tanh~!(x)) 
or 
cosh? (tanh! (x)) = 
|] —x? 
10. (a) If y=sinh™' x, then 
5 @ — e 7 
x=snhy = 5 
so 
e”'—e 7 = 2x, 
e?” —2xe’ -1=0, 
Be 2x + V4x244 
7 2 

so 

e =x+V1+x? since e’ > 0 
or 

y =sinh 'x =log(x + ¥14 x2). 
Similarly, 


cosh”! x = log(x + Vx? — 1), 


tanh! x = 5 log( +x)- 5 log(1 —x). 
(b) 


b 
/ dx =sinh-'b—sinh~'a_ by Problem 9c) 


v1+x? 
= log(b + V1 +b?) —log(a+ V1 +a’). 
[ 1 ict ie ae a,b> I 
a V¥x2-] —log(—b + Vb? —1)+log(—a+vVa?—1) a,b<-l 


b 
dé= sllogi + b) — log(1 — b) — log(1 +a) + log(1 — a)}. 


| — x2 


ZN): 


pile 


aj, 


(a) 


(d) 
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lim a* = lim e*'8¢, Since loga < 0, we have lim x loga = —oo, so 
BN PO xX 0O X— CO 
lim e* 84 = 0. 
GO) 
(lo x)" y 
lim 2 = lm — =0. 
X00 af yoo ey 


lim x* = lim e*!°8*, Now, lim x logx = 0 by part (d), so lim x* = I. 
x—0t x—>0t x—>0+ x>0t 


lim log (1 + yy = lor (1) —'1. 


lim x log(1 + 1/x) = lim log(1 + y)/y = 1. 
X—> CO w= ae 


e =exp(]) = exp(lim xlog(1 + 1/x)) 
(elm cxpcelos@ + 1/x)) 
ca OO 
= ml, 1 /x)’. 
xXx—0O 
(The starred equality depends on the continuity of exp at 1, and can 
be justified as follows. For every ¢ > 0 there is some 6 > O such that 
le — expy| < € for |y — 1| < 5. Moreover, there is some N such that 


|x log(1 + 1/x) — 1| < 6 for x > N. So |e — exp(x log(1 + 1/x))| < € for 
we N. 


= [him a Way) = tm Oa), 
slit @ + 1/x)** 
="lin( ayy) 
yoo 


After one year the number of dollars yielded by an initial investnent of one 


dollar will be 


(b) 


hte (Clee @/l00x) = Pie 


Clearly 4), =i sion = Ue livwe Ujtheney(x) = log(=x jase 
FO@)y=()el/(—xsSayx. 

We can write log | f| as go f where g(x) = log |x| 1s the function of part 
(a). So (log | |)’ = (g'° f) - f= I/f- f’. 

etme (x) = Foye] When 

aS fx) — f x )ee™ 


e2cx 


a4) = 


so there is some number & such that g(x) = & for all x. 

According to Problem 21, there is some k such that A(t) = ke’. Then 
k = ke = Ag. So A(t) = Ape”. 

If A(t +1) = A(t)/2, then 


Age“ 
7? 


Age t= 
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so ete? = &%/2or e&F = 5, so T = —(log2)/c. It is easy to check that 
this t does work. 
23. Newton’s law states that, for a certain (positive) number c, 


T'(t) =c(T — M), 
which can be written 
(T —M)' =c(T —M). 
So by Problem 21 there is some number k such that 
T(t) —M=ke", 


and k = ke? = T(0) —-M = Ty — M. So T(t) = M + (Tp — Me. 


CHAPTER 19 a Gece Ve V ae ca a 

_vx-1-vx4i 
Vrate yet ~2 ) 
(iii) (e* + e7* + e**)/e* =e + e-** +e. 
(iv) a*/b* = (a/b)* = eX @84/), 


(v) tan? x =sec?x — 1. 


; l 1 /a? 
ba a+x2 x 
| af 
m l | /a 
OO ———$——— 
va2—x2 VI] — (x/a)? 
_ l l — sinx | — sinx -< 
(vin), ——— = —\|_— = —— = sec“ x — sec x tan x. 
1] +sinx 1 —sin* x cOs- x 
? 
= 4 6 
(1x) abba e = 8x —2 + —— 
x+1 | 
| l 


0) —————— 


if 2x == x2 V1 =e — 12 


i) 


(i) —cose*. (Let u=e*.) 

(iii) (log x)?/2. (Let u = log x.) 

(v) ef. (Letu =e.) 

(vii) 2ev*. (Let u = Jx.) 

(ix) —(log(cos x))?/2. (Let u = log(cos x).) 


3. (i) [ve dx = x7e* — / Qre* dx = x7e* —2 [ve _ / e ax| 


Ce 
x“e* — 2xe* + 2e". 
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(ii) We have 


= esinbx  b ; 
i e*sin'bx dx: = ————_ — -— _| :e™ cos bx dx 


a a 
e*sinbx bj|e“cosbx  b on , 

a ey ee ree Se eEY 

a a a a 
sO 
Aes a = b 
e™ sin bx dx = +——e™ sin bx — ——e™ cos bx. 
a* +b a---b- 


(v) Using the result [ (log x)? dx = x(log.x)* — 2x (log x) + 2x from the text, 
we have 


[oe x) dx = [x(logx)? — 2x (log x) + 2x] log x 
l 
— i —[x(log x)? — 2x(logx) + 2x] dx 
x 
= x(logx)* — 2x (log x)? + 2x log x 
- [tog as + 2[xlogx —x]—2x 


= x(log x a 2x (log x yr 42x log x 
— [x(log x)? — 2x (logx) + 2x] +2[x log x — x] —2x 
= x(logx)*> — 3x(logx)* + 6x log x — 6x. 
(vii) 


9 : 
[ocx ax= [occ x)(secx) dx = tan x secx — [oans )(see x tan x) dx 


= tan x sec x — sec x(sec* x — 1) dx 


3 
= tan xsecx — / sec’ x dx + / sec x dx, 


[sc xdx = }/tanx secx + log(sec x + tan x)]. 


sO 


(ix) 
9 
2x3/2 2 3/9 l 
J/x log x dx = logx — = | x°'“--—dx 
f : 3 3 x 
2x7! 2 
= logx — = | x /“dx 
3 2 3 
i) 
2x3/2 | j 4 3/2 
= Og x — ox. 


i) Let x =sinu, dx = cosudu. The integral becomes 


cosudu : 
ldu =u = arcsinx. 


ay) =3 
v1 —sin“u 
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(iii) Let x = secu, dx = secutanudu. The integral becomes 


secutanudu 
—___—_—_———- = } secudu = log(secu + tan u) 
Vsec2u — 1 
= log(x + Vx? -1). 


(v) Let x =sinu, dx =cosudu. The integral becomes 
cosu du 
= | cscudu = —log(cscu + cotu) 


ms 59 
sinuy | — sin“ u 
2 
l v1l-—-<x- 


= —log{—+ 
x 


(vii) Let x = sinu, dx = cosudu. The integral becomes 


: 3 ag 2 : 2 2 
[vin ucosu)cosudu = [sn ucos- udu = [on u)(1 — cos* u) cos- udu 


cos? ul cos? u 


z 7 Ss 


= [isin u)(cos? u — cos? u)du = — 
d—2 , d—x*h2 


i ns 


. 9 a a . 
(ix) Let x =tanu, dx = sec~ udu. The integral becomes 


2 3 
/ secusec’ udu = / sec’ udu 


= 5 {tan usecu + log(secu +tanu)| by Problem 3(vii) 


= S[xv1 4+ x24 log(x + V¥ 1+ x )|. 


5. (i) Letu=V¥x4+1,x =u? —1, dx =2udu. The integral becomes 


2udu —2 
= 2+ du 
/ l+u ft 1+ -) 


= 2u —2log(1+u) = 2V¥x+1-—2log + Vx+1). 


1/6 


(iii) Let uw =x!/, x =u°, dx = 6u du. The integral becomes 


6 Sd l 
/ oy eee r — 6 | (w —u+1-— ) du = 2u? — 3u* + 6u— 6log(u + 1) 
ur +u? u+] 


2f/x —33/x +69 — Glog(4/x +1). 


| 


(v) 
du 
| epee 5 (= 
du 
=; 2+u 
] 
“iS 


By 


(vn) Let w = 2*, x = (ogu)/(og 2), dx 


=o ft 
log 2 u(u + 1) 
=cof('+s-a 

log 2 “u eut+ti 


+ logu — 2 log(u 


becomes 


af 
log 2 


u>+1 
—§| du 
(u+1)u 


= bg 


=o ) 


(ix) 


| 


Now let u = sin y, du = cosydy, 


2 cos y sin y cos y 


i) 


II 


| 


l—smny 


Let u = tanx, x = arctan u, dx = du/(1 + ur). 


-3/ Te 


logi2 + u) — = log +u- Ee 5 


log(2 + tanx) — 


—2cosy+y— 
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The integral becomes 


u—2 
l+u? 


_ du 


Qu 
1+ Fie 


du 
——s 
1+ u- 


‘s)ise 


arctan u 


2 
=X. 


I 2 
To log(1 + tan* x) + 5 


du/(u log 2). 


) du 


The integral 


+1)] 


[2* + x log 2 — 2log(2* + 1)]. 


B) x . 
Let u = /x, x = u*, dx = 2u. The integral becomes 


V1l—u22udu 


l—u 


The integral becomes 


52 ; 
(1 — sin“ y)sin y — 


1 —siny 


2 fa + sin y)sin ydy 


2 f sin ydy+ [i —cos2ydy 


sin 2y 
D 


a 


= —2cosy + y —sin ycos y 


; ——s 
1 —u? + arcsin u — uy 1 —u2 


—2/1 —x +arcsin Vx — JxV l—x. 


me ae . 
The substitution u = V1 —x, x = 1 —u-, dx = —2u du leads to 


ies 


4 
2u-du 


V1 —u2 
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and the substitution u = sin y then leads to 


Ae) 
/ —2sin° y cosy dy 


l—cosy 


—2sin y — y —sinycosy 


= —2u —arcsinu —uy | — u2 
= —2/1—x —arcsin/ 1 —x—-—vV1- avx. 


These answers agree, since 
; IT , 
arcsin /x = > ~ aresin l-—x 


(check this by comparing their derivatives and their values for x = 0). 
6. In these problems / will denote the original integral. 


(1) : : 
| dx+ f — 5 dx 
x—] (x + 1)? 
2log(x — 1 P 
= £10Q\X — Jae an 
(iit) 
l 4 
t= | — part f as 
(x — 1)- (x + 1)3 
| 2 
~  @=-1D @+D? 
(v) b 4 
2x 
== J == ae —— dx 
sfanet+fae 
= 5 log(x? +1)+4arctan x. 
(vii) 


| 2x 
l= dx + | —~———— dx 
(x + 1) (x*-+x+1) 
l 2x | 
= ax+ f = ax- f =. 
x+1 x*+x+1 x-+x+1 


Now 
l | 
/ SLL eT dx = / ea Ea dx 
Xe 1 (x+ 5)? +3 


[ ———— «x 
*| 


4, (x4 ) | 4 


ve arctan (3, (x aE :)) 


J3 


—- 3 arctan (3, (x + +)) : 


wm wi) 
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sO 


I = log(x + 1) + log(x* +x + 1) — d arctan (3 (x + +)) 


2x + | | 
ri a if 7 Sy a dx = Pa, a ee GE aE dx 
(x-+x+1)- (x-+x+1)- 


/ 2x + | j al ] dx 
= oe OY i Se 
(x? +%x+4+1) 9 2 ° 


Now the substitution 


~ 


9 [3% 
u=+,(x+34). dx = >> du 
V3 2 2 
changes the second integral to 


16 V3 du 
9 2 (uz + 1)2° 


Using the reduction formula, this can be written 


8/3 u +5 du 
9 |2?+1) 2/) w2+1]’ 


So 


to 


] l J3(x +4) 4/3 ( ( 4 )) 
= = SS I EO > oe act | Se [2X + 54). 
xtetet] 4243x411) 9 V3 2 


14. The equation [ e* sinx dx = e* sinx —e* cos.x — { e* sin.x dx means that any 
function F with F’(x) = e* sinx can be written F(x) = e* sinx — e* cosx — 
G(x) where G is another function with G’(x) = e* sinx. Ofcourse, G = F+e 
for some number c, but it is not necessarily true that F = G. 


16. (a) 


: : P x 
/ arcsin x dx = / l - arcsin x dx = x arcsin x — / ————-. dx 
5 
+f: l—x- 
: / > 
x arcsin x + V1 —x-. 
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17. (a) 
sin?xcosx. 3 f ..4 
[ sn’ dx = — ETE Sf sin x dx 
<3 ; 
sin” x cos x ce sin x cos X 4 | fra 
x 4 4 2 9 poo 
be sin?xcosx 3sinxcosx 3 
a 4 8 8 


sin 2x +3/ | + cos4x 
TF 7 ax 


1 —cos2x \? | - cos? 2: 
[sintxar= | ( 2 *) iv= f (7-4) a 
ee 
a 
x 


4 

* sin 2x IL[x . sn4x 

a #5 \5+ 8 
3 


x  sin2dx  sin4x 


<p 


(b) It follows that these two answers are the same, since they have the same 
value for x = 0. 


Ze fa) 
sin” xax= [in x)(sin"! x) dx 
= —¢osx sin”! x + (n — 1) / cos x (sin”~* x) cos x dx 
= —cosx sin” |! x +(n—1) [on x — sin" x)dx, 
sO 
[ sn" xdx =— : cosx sin”! x + dl [sw x dx. 
n n 
(b) 


[ costxax = [icosxy(cos"' x)dx 
= sinx cos” !x+(n—1) y sin x(cos”~? x)sinx dx 
= sinx cos”! x+(n— 1) f (cos x — cos" x)dx, 
so 


l n—2 
cos" x ax= = 5 ain Xx cos"— Pres a COs x dx. 
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(c) 


j= a x? ax 
(x2 ele oat (x2 ae — 1 (x2 + 1)" 


dx x 
dx a 
(x2 an ])r-1 2 aq = n)(x2 aL Jjr-l 


dx 
2(1 —n)(x? + <= 
sO 
] x 2n — 3 ] 


Shee SE ee ren RN | ee 
G2 22h] ) Ge)! 2 | ep 


‘ . > i 
We can also use the substitution x = tanu, dx = sec“ udu, which 
changes the integral to 


9 
sec udu me) 
—= = cos2"~2 udu 
SEGa hu 


l 2h 3 5 
== cos-”—3 usin ut alle cos’ 4 udu 
2n —2 2n —2 
] l x 2n —3 dx 


ro= ‘ee No cece x 2n—2 5 (x24 1)"71 


| By +o dx 
~ 2-1) 24D"! 'n-2/ @4ep 


CHAPTER 20 1. @) P3o(xv) =et+ex tex? + (Se/3!)x?. 
eS (x — 2/2)? (x—2/2)4 (—1)"(x — 2/2)?" 
(ut) Pan a/2(x) = 1 Te! =402) .) Cn 
al 2 x= 1)? 
(v) Peace: ©O —*) ee ee 


ni} 
(vii) Pyg(x) =x+ Pan 
(eer eee ex? —-- + (ee 
2. If f is a polynomial function of ee n. then f+) = 0. It follows from 
Taylors hecorcniinaten,,, (x) = 0, so f(*) = Fae). 


Ge SPE 3) liens). 
Cie PONCE SB) 270 (x = 3)? -- O02 seal emer as) 


9 
(Sy) I “i ¢ 
ore) 2 GF+ DI i+)! (since Qua! < 107°“ for 2n +2219, orn = 9}. 
= ©) 
oF 7 : 20502 2 
(1) 2, OQL4 1! (sinc 72n42(In + 2)1 = Id foi 2n + yp ea 18, 


orn > 8), 
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CHAPTER 22 


CHAPTER 23 


13 2! 32. gat 
(v) 2 ay smce Fea < 107° forn+1> 14, orn = 13 


(1) Ga= a; + D;. 
(111) — ( = 1)a;. 


(v) co =| ite) 626; =a ya lore O, 
0) 


QQ) l—-af/(n+ 1) =1/@4+1) <e forn+1 > I/e. 


(ii) lim Vn2+1— Vn +1 = lim (Vn? + 1- Vn2)+4 lim (Yn- /n +1) 
R—- OO A> Oo n> Ov 
=0Q+0=0. (Each of these two limits can be proved m the same way 


that lim (/n + 1 — /n) = 0 was proved in the text.) 
H—-> OO 


(v) Clearly lim (loga)/n =0. So lim “a = lim e@°8®/" = ° (by Theo- 
n> OX 


iu OO i> GX 
relpal) = Ik 


(vii) Vn2 < Vn24+n < V2n2, so (Yn)? < Vn24n < V2(2/n)?, and 


lim ( Vn)? = lim V2(4¢/n)? = | by parts (v) and (vi). 


Hi— Oo 


(ix) Clearly a(7) < log, n, and Inn (log, n)/n = 0. 


: 
(a) IPO eeinena = 2¢ <4 soa < V2q < 2. 


(b) Part (a) shows that 


ef 8 = VIP oo 2 


so the sequence converges by Theorem 2. 
(c) If this sequence is denoted by {a,}, then the sequence { / 2a, | is the 
same as {41}. 90 tlie hint shows that? = V2/, ort =2. 
If x is rational, then na!zx is a multiple of wz for sufficiently large n, so 
s ay . . SHE . 
(cosn!ax)** = | for all such n, so lim (lim (cosn!ax)**) = 1. If x is 
noo k-00 
irrational, then n! zx 1s not a multiple of a for any n, so | cosn!ax| < 1, so 
. BE 
lim (cosn!ax)-* = 0, so f(x) = 0. 
kKou 


I 
(i) iL e* dx =e-— |. (Use partitions of [0,1] into 7 equal parts.) 
0 


al 
(111) i Ae log 2. 
0 


M9) i | 9 Ee. | 
Vv ——idx=-. 
0 (+x)2. Z 


. F , 6 4 4 

(1) | (Absolutely) convergent, since |(sin n@)/n-| < I/n-. 

(i) Divergent, since the first 27 terms have sum eee - tale Weibmizss 
‘Theorem does not apply since the terms are not decreasing in absolute 


value.) 
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(v) Divergent, since 


(vn) Convergent, since 


(n+ 1)?/(n + 1)! ; ea | 
1 ——.————_ = lim . 
noo n-/n! n— oo 

(ix) Divergent, since 1/(logn) > 1/n. 

re : és 

(xi) Convergent, since 1/(logn)" < ae forn > 9. 


(xii) Divergent, since 


for large enough n. 


(xv) Divergent, since 
N ] 
ik —— dx = log(log N) — log(log 2) + oo as N — oo. 
2 xlogx 


(Notice that f(x) = 1/(* log x) is decreasing on [2, 00), since 


—|1 + log x] . 


5 0) for x > 1. 
(x log x)- 


f(x) = 
(xvii) Convergent, since 1/n?(logn) < 1/n? for n > 2. 
(xix) Convergent, since 
Hin + Din + 1"! _  2(n+1)n" 


lim —_ I ow“e itis tim am 
noo 2"n!/n" noo (N+ Lye) 


~~ 
to 


by Problem 18-17. 


(a) For each N we clearly have 


N co 
0 < De angle < 9)" 10-" = 1, 


n=1| n=] 


CO 
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sO ) a, 10™" converges by the boundedness criterion, and lies between 


n=| 


0 and 1. (Actually, this number is denoted by 0.a;aza3a4.... only when 


the sequence {a,} is not eventually 0.) 
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CHAPTER 24 


The area of the shaded region is 5. The integral is 


atl 31+ La] + Lig = a3) #3 Ulg — al + Lg — te] #2) 
SEG Tete at 1G + tebe at) 
st tit tads-j-ta+tet eae.) 
=s(\+7+ptpt ) 

8 4 “2 4 
ae 
i 

$i 

==. 


. . 0. 2=0 

Fla) = tia) | l, O<ez1 
{fn} does not converge uniformly to f. 

(im) f(x)= lm f(x) = 0 (since lim. x” = oo for x > 1). The sequence 
{fn} doesnot converge Bre Ree f; im fact, for any n we have f,(x) 
large for sufficiently large x. 

(v) f(x) = lim f,(x) = 0, and {f,,} converges uniformly to f, since 
| fn(x)| = Ti for all x. 


; 
] a x* 


= ir ter aaa 

Soe — 
(iii) eu | col 

k=0 

= 

6 -1( | 
k 2k+1 
(v) = eS pe. ee 

—- 2k +1 
Gi .e=* 
(ii) If 

ee x? 3 x4 eo 
[WO= 5 3.90 + 3 my x|< 
then 
2 3 
f'(a)=x- a ee 


= log(1 + x) |x| < 1, 


so for |x| < 1 we have f(x) = (1 + x) log(1 + x) — (1+ x) + ¢ for 
some number c. Since f(0) = 0, we have c = 1, so f(x) = (14+ x): 
log(] +x) —-x for |x| < 1. 


CHAPTER 25 


Since 
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oe = 1)"x 2n+} 


ae aa 
2) { 
= (2n + 1)! 


we have 


0° (—1y'x2" 
fe) = 2 (2n + 1)! 


(notice that the right side is | for x = 0). So 


ely 
f°? O) =% HLT’ 
0, k odd. 


cn 


(3 + 4e| = 5: = arctan . 

(1 +7) | = (1 +i) = (/2)°: since 2/4 = arctan 1/1 is an argument 
for 1 +7, an argument for (1 a i)? is 5/4. 

[(13 + 47|)| = [5] = 5; 6 =0. 


-i+t/-1-4 
= 2 
_aitv5i 


a 
_(ltv5)i (1 V5)i 
5 or a 
x7 + 27x —1 =(x +i)?, so the only solution is x = —i. 


x? = xG— x — %— "= Des a | I) Whe coltitions are 


t 3. foes. 
- = i. 


All z = iy with y real. 

All z on the perpendicular bisector of the line segment between a and b. 
For s = x +iy, we need Vx? + y? < 1 —x. This requires that 1 — x > 
0, and then our inequality is equivalent to x* + y? < (1 — x)?, or 
x < (1 — y*)/2 (and conversely this inequality implies that x < 5, so 
that 1 ~x > 0 holds). The set of points x +iy with x = (1 — ye is the 
parabola pointing along the second axis, with the point 5 + Oi closest 
to the origin, and which passes through the points 0 +7 and 0 — i; the 
desired set of complex numbers is the set of points inside this parabola. 


[et iy|? =x? + y? =x? + (—y)* = |v — iy’. 

(Zz +2)/2 = [((« tiy) + & —iy)]/2 =x. 

(z — z)/2 = [( +iy) -—(@ —iy)]/2i = y. 

lz + wl? + |e — wl? = @ + w)Z + H) + @ — w)E— w) = 222 + QwH = 
2(\z/* + |w|?). Geometrically, this says that the sum of the squares of the 
diagonals of a parallelogram equal the sum of the squares of the sides. 


662 Answers to Selected Problems 


Lo, ©) 
amg! 1. (i) | Converges absolutely, since |(1 +i)"/n!| = (/2)"/n!, and S (v2 y"/n} 


n=! 
conve rees. 


(iii) Converges, but not absolutely, since the real terms form the series 


and the imaginary terms form the series 


(}-44$-44-+). 


(v) Diverges, since the real terms form the series 
log 3 log4 log5 — log7 log8  log9 
. 9 a 4 ‘ « 

gg Pa a ee 
2. (i) The limit 


le y(n+1)? n 

1. ——_——,— =. lim 

n> [z|" [ne noo \n+ 1 
is < | for |z| < 1, but > 1 for |z| > 1. 

a) ‘The limit 

sjnt+l 


Pan 4 
lim = [z 
eae \z| 


is < | for |z| < 1 but > | for |z| > 1. 
(v) ‘The limit 


gnth (z+)! 


lim = fim 2)2(@rD!-0! 
n> oo 2"|\z|"" aes Iz| 


is 0 for |z| < 1, but oo for [z| > 1. 
3. (i) The limits 


\z|2" \z|2"+! 


= fel and lim ° 


3n 7 J3 n= 


are < I for |z| < V2, so the series converges absolutely for |z| < V2. 


- 
7 
~ 


. 2n 
lim 
nx 


n+ 1 


ol 


But the series does not converge absolutely for |z| > V2, so the radius 
of convergence is V2. 
(i) Since 


wos WEE gs =e ; 
— i hies) Mt =e by Problem 22-I(vi), 


im 21! lz} 
im 
n— Oo Qn n— oo 


- 


the radius of convergence is 2. 


(v) ‘The limit 
lim ¥/2"z"! =2 lim 2)! 


no nN 


is O for |z| < 1, but oe for |z| > 1, so the radius of convergence is 1. 


GLOSSARY 
OF SYMBOLS 


Pog 
la| 1] 
es he 


max(x,y) 16 
min(x,y) 16 
€ (“epsilon”) 18 
Neel 

G 23 

mi 623 

yia 24 

i=) 

25 

Q 25 
R25, 589 


Oe 


f(x) 40, 47, 601 
I 43 

ft+e 43,245 
ANB 43 
Jeg a8 

ie) 43 

c-g 43 
|Cceegeeseay mee 
fa eae, z} 644 
f+e1n0 44 
f-a:h 44 
fog 44 
fogoh 45 
x—> f(x) 45 
a” 49 


[ [@ 49 

i=! 

Ca 50 

AUB 50 
R-A_ 50 

lial ees! 
Imax(f,2) | Sil 
min(f,g) 51 
face 353 

the pair (a,b) 54 


the open interval (a, b) 


56 
Ia, b) 57 
(,00) 3/ 
la, co) 57 
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(—coo,a) 57 
(—co,a] 57 
(—oo,00) 57 
[| pe fee 
(Palme? (2 
v+w 75 
vew 78 
lvl] 78 
det(v,w) 79 
6 (“delta”) 98 


Him fey 101 
lim f 101 
jim, f(x) 106 
lim f(x) 106 
lim f(x) 106 
ne f GO) B06 
Him Fe) 106 


lim f(x) 113 


lum f (x) ="eommalils 


lim f(x): sag es 


supA 134 
lubA 134 
infA 134 
glbA 134 
limA 143 
limsup A 143 
limA_ 143 
f(a) 15) 
ja Misi 
df (x) 
dx 
df (x) 
dx x=a 
ft" ‘61 
fT" Jel 
Fa 161 
d? f (x) 


dx? 


fae o231 


154 


155 


162 


e 244, 331 
c+td 245 
a-c 245 
ced 246 
det(c,d) 246 
c 246 
R(f.a.b) 253 
Leh yar 254 
UCGgery 254 


[s 258 


b 
[ seoax 264 
COP) ele 


L(x) 278 
b 

it : f 295 
b 

U i f 295 


i: 301 


i f G@idx © 301 


sin 304 

sin’ 304 

xz 305 

A(x) 306 

cos 306, 308, 563 
smn 306, 308, 563 
sec 310 

tan 310 

csc 310 

cot 310 

arcsin 310 
arecos 311 

arctan) 311 

e 331 

log 341 

exp 343, 563 

e 343 
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e* 344 

a’ 345 

log, 346 
sinh 353 
cosh 353 
tanh 353 
arg cosh 353 
arg sinh 353 
arg tanh 353 
Nap log 358 


b 

F(x) 363 
If 364 
J f(x)dx 364 
Tix) 394 
£,., (ANZ 
Pan f 4 l 2 
Rig 42) 
[P|, 434 
is) 437 

n 
fa,} 452 


lima, 453 


Nn XO 


lima, =0co 456 
noo 


y 463 
lim x, 467 


noo 


limsupx, 467 


n> OO 


lim x, 468 


n-- Oo 
liminf x, 468 
n-> XX 


Ni(n:a.b) 469 


x 


yo a, 472 


i=1 

i °526;-532 
Cc 33 
z 534 

\z| 534 

Re 541 

Im 541 

6 542 

lim f(z) 542 


f(a) 551 


sin 563 
cos 563 
exp 563 
bs S72 

B; 5/2 

Di. STS 

DF S75 
e- 573 

A 573 

G, “DIS 

wv, 576 

+ 581,591 
« 581,594 
0 581,591 


1 382,596 
—a 582,592 
a! 582, 596 


P 583 

> 584,590 
< 584, 590 
> 584, 590 
s 584, 590 
Je] 595 


AalbmcndoE, 276 
Abel, Niels Henrik, 410, 522 
Abel summable, 522 
Abel’s formula for summation by parts, 
392 
Abel’s Lemma, 393 
Abel’s test, 496 
Abel’s Theorem, 522 
Absolute value, 11 
of a complex number, 534 
Absolutely convergent, 480, 556 
Absolutely summable, 480 
Acceleration, 161 
Acta Eruditorum, 148 
Addition, 3 
associative law for, 9 
commutative law for, 9 
of complex numbers, 531 
geometric interpretation of, 535 
of vector-valued functions, 245 
of vectors, 75 
Addition formula 
for-arcsin, O17 
fOrarctans 37 
for cos, 314 
for sin, 313, 314 
for tan, 317 
Additive identity 
existence of, 9 
for vectors, 76 
Additive inverses 
existenGe.o,, 9 
Algebra, Fundamental Theorem of, 
5371, Dobe 240 g509 
Algebraic functions, 363 
Algebraic number, 442 
Algebraist’s real numbers, 598 
Almost lower bound, 142 
Almost upper bound, 142 
Analyst’s real numbers, 598 
Angle, 303 
directed, 303 
Antidiagonal, 243 
Arabic numerals, multiplication of, 8 
Arc length, 278, 283 
Arccos, 311 
derivative of, 311 
Archimedes, 138, 141, 263 
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Archimedian property 
for the rational numbers, 584 
for the real numbers, 138 
Archimedian spiral, 85, 249 
Arcsec, 320, 383 
Arcsin, 310 
addition formula for, 317 
derivative of, 311 
Taylor series for, 517 
Arctan, 311 
addition formula for, 317 
derivative of, 311 
Taylor polynomials for, 413, 420 
remainder term for, 426 
Area, 253, 258 
Arg cosh, 353 
Arg sinh, 353 
Arg tanh, 353 
Argument, 536 
Argument function, 542 
discontinuities of, 546 
Argument of the hyperbolic 
functions, 353 
Arithmetic mean, 33 
Arrow, 75, 76 
“x arrow sin(x7)”, 45 
Associative law 
for addition, 9 
of vectors, 76 
for multiplication, 9 
Average velocity, 152 
Axis 
horizontal, 57 
imaginary, 534 
real. fo 9 
vertical, 57 


Bacon, Francis, vi 

Basic properties of numbers, 3 

“Bent graphs”, 149 

Bernoulli, Jakob, 148, 574 

Bernoulli numbers, 572 

Bernoulli polynomials, 575 

Bernoulli’s inequality, 32 

Big game hunting, mathematical 
theory of, 552 

Binary operation, 581 
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Binomial coefficient, 27, 437 
Binomial series, 495, 518 
Binomial theorem, 28 
Bisection argument, 142, 552 
Bohr, Harold, 394 
Bolzano-Weierstrass ‘Theorem, 458, 469, 
do2 

Bound 

almost lower, 142 

almost upper, 142 

greatest lower, 134 

least upper, 133, 584 

lower, 134 

upper, 133 
Boundedness criterion, 474 
Bounded above, 122, 133, 457, 584 
Bounded below, 134, 457 
Bourbaki, Nicholas, 148 


Cantor, Georg, 448 
Cardioid, 89, 250 
Cartesian coordinates, 84 
Cauchy 
Condensation Theorem, 496 
criterion, 473 
form of the remainder, 437 
Mean Value Theorem, 204 
product, 493, 513 
sequence, 459, 571 
equivalence of, 599 
Cauchy-Hadamard formula, 569 
Cauchy-Schwarz inequality, 281 
Cavalieri, Bonaventura, 275 
Cesaro summable, 493 
Chain Rule, 174 ff 
proof of, 178 
Change, rate of, 152 
Characteristic (of a field), 586 
Circle, 65 
“f circle g”, 44 
unit, 66 
Circle of convergence, 560 
Classical notation 
for derivatives, 154-156, 162, 167, 
187; 24)=242 
for nitegrals, 265 
Cleio, 186 
Closed interval, 57 
Closed rectangle, 547 


Closure under addition, 9 
Closure under multiplication, 9 
Commutative law 
for addition, 9 
of vectors, 76 
for multiplication, 9 
Comparison test, 474 
limit, 475 
Comparison ‘Theorem, Sturm, 323 
Complete induction, 23 
Complete ordered field, 584, 603 
Completing the square, 17, 379 
Complex analysis, 565 
Complex function 
continuous, 545, 546 
differentiable, 550 
graph of, 542 
limit of, 542 
nondifferentiable, 551 
‘Taylor series for, 563 
Complex nth root, 536 
Complex numbers, 526, 531 
absolute value of, 534 
addition of, 531 
geometric interpretation of, 535 
geometric interpretation of, 533° 534 
imaginary part of, 531 
infinite sequence of, 555 
infinite series of, 555 557 
logarithm of, 570 
modulus of, 534 
multiplication of, 531 
geometric interpretation 
of, 535-536 
real partof, 531 
Complex plane, 533 
Complex power series, 557 
circle of convergence of, 560 
radius of convergence of, 559 
Complex-valued functions, 541 
Composition of functions, +4 
Concave function, 220 
Conditionally convergent series, 481 
Cone, 80 
generating line of, 80 
surface area of, 404 
Conic sections, 80; see also Ellipse, 
Hyperbola, Parabola 
Conjugate, 534, 539 


Conjugate function, 541 
Constant function, 43 
Construction of the real numbers, 
588 ff. 

Continued fraction, 462 
Continuous, uniformly, 144 
Continuous at a, 115,545 
Continuous function, 115, 118, 546 

nowhere differentiable, 159, 509 
Jontinuous on (a,b), 118 
Continuous on [a,b], 118 
Contraction, 466 
Contraction lemma, 466 
Converge 

poimtwise, 502 

uniformly, 502, 506 
Convergent sequence, 453, 555 
Convergent series, 472, 556 

absolutely, 480, 556 

conditionally, 481 
Convex function, 219 

strictly, 228 

weakly, 228 


Convex subset of the plane, 229, 553 


Cooling, Newton's law of, 356 
Coordinate 
first." 7 
second, 57 
Coordinate system, 57 
cartesian, 84 
origin of, 57 
Coordinates 
polar, 84 
“Corner, 60 
Cos, 303-304, 306, 321-322, 563 
addition formula for, 314 
derivative of, 172, 307 
inverse of, see Arccos 
‘Taylor polynomials for, +13 
remainder term for, 424 
Cosh, 358 
Cosine, hyperbolic, 353 
Comes 
derivative of, 310 
Countable, 449 
Counting numbers, 21 
Critical point, 190 
Critical value, 190 
Csé, Fl0 


Index 


derivative of, 310 
Cubic equation, general solution, 
528 529 


Curve 


671 


parameterized, tangent line of, 246 


parametric representation of, 244 
reparameterization of, 247 
Cycloid, 250 


Darboux’s Theorem, 214 

De Moivre’s Theorem, 536 
Decimal expansion, 73, 492, 599 
Decreasing function, 195 
Decreasing sequence, 457 
Dedekind, Richard, 38 
Defined implicitly, 241 

Definite integral, 365 
DEFINITION, 47 

Definition, recursive, 23 
Degree (of a polynomial), 42 
Degree measurement, 304-305 
Delicate ratio test, 493 
Delicate root test, 493 

Dense, 140 

Derivative, 149 ff, 151 


classical notation for, 154-156, 162, 


167, 187, 241-242 
higher-order, 16] 
“infinite”, 158 
left-hand, 156 


Leibnizian notation for, see Derivative, 


classical notation for 

logarithmic, 351 
“negative infinity’, 158 
ot fx VSil 
of f at apo 
of vector-valued function, 246 
right-hand, 156 
Schwarzian, 184 
second, 161 

Schwarz, 439 


Derivative of quotient, mcantation for, 


171 
Descartes, René, 84 
Determinant, 79 
of vector-valued functions, 248 
Diagonal, 233 
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Difference operator, 573 
Diflerentiable, 151, 550 
Differential equation, 292, 300, 321, 


322-323, 356, 361, 438-440 
initial conditions for, 440 


Differentiation, 168 ff. 


implicit, 24] 
logarithmic, 35] 


Differentiation operator, 573 
Dini’s ‘Theorem, 524 
Directed angle, 303 
Dirichlet’s test, 495 

Disc method, 402 


Discontinuities of a nondecreasing 


functton, 450 


Discontinuity, removable, 121 
Disraeli, Benjamin, 2 


Distance, 58, 534 


shortest between two points, 278 


Distributive law, 9 
Diverge, 453, 556 
Division, 6 

Division by zero, 6 
Domain, 40, 41, 47, 601 
Dot product 


of vectors, 78 
of vector-valued functions, 246 


Double intersection, 165 
Double root, 185 
Durége, 38 


343 

irrationality of, 429 
relation with 2, 448, 564 
transcendentality of, 444 
value of, 344, 426 


Eccentricity of ellipse, 87 
Elementary function, 363 


Ellipse, 66, 82 


axes of, 57 

eccentricity of, 87 

equation in polar coordinates, 86-87 
focus pot of, 66, 86 

major axis of, 87 

mimor axis of, 87 


Ellipsoid of revolution, 405 
empty collection, 23 


Entire function, 567 
Epsilon, 18 
Equal up to order 2, 418 
Equality, order of, 418 
Equations, differential, see Differential 
equations 
Equivalent Cauchy sequences, 599 
Etymology lesson, 82 
Euler, Leonhard, 575 
Euler’s number, 463 
Euler-Maclaurin Summation Formula, 
576 
Even function, 51, 199 
Even number, 25 
Eventually mside, 555 
Exhaustion, method of, 141 
Exp, 343 ff, 563 
classical approach to, 357 
elementary definition of, 468 
Taylor polynomials for, 413 
remainder term for, 426 
Expansion, decimal, 73, 492, 599 
Extension of a function, 115-116 


Factorial, 23 
Factorials, table of, 432 
Factorization into primes, 


(U3) 
= 


Fibonacci, 32 
Fibonacci Association, 32 
Fibonacet: Quarterly, The, 32 
Fibonacci sequeneey 329215 372 
Field, 581 
characteristic of, 586 
complete ordered, 584, 603 
ordered, 583 


First coordinate, 57 


First Fundamental Theorem of Calculus, 


285 

Fixed pomt of a function, 465 
Focus point, 66, 86 
Force, as vector, 76 
Four leaf clover, 88 
Fourier series, 318, 320, 323 
Fraction, continued, 462 
Function, 39, 47 

absolute value, 541 


Function (continued) 
argument, 542 
discontinuities of, 546 
complex valued, 541 
composition of, 44 
concave, 220 
conjugate, 541 
constant, 43 
continuous, 115 ff. 
convex, 219 
critical point of, 190 
critical value of, 190 
decreasing, 195 
derivative of, 149 ff. 
differentiable, 151, 550 
elementary, 363 
entire, 567 
even, 51, 199 
exponential, 343-344 
extension of, 115-116 
fixed point of, 465 
from A to B, 601 


from real numbers to the plane, 244 


graphs of, 57-65, 198, 542 
hyperbolic, 353 

identity, 43 

imaginary part, 541 
implicitly defined, 241 
increasing, 195 

integrable, 258 

integral of, 258 

mverses, 2.510 ih, 

linear, 58 

local maximum point of, 189 
local minimum point of, 189 
local strict maximum point of, 218 
logarithm, 341, 346 
maximum point of, 188 
maximum value of, 188 
minimum value of, 188 

most general definition of, 601 
negative part of, 51 
nondecreasing, 243 
nonincreasing, 243 
nonnegative, 51 

notation for, 40, 45 

odd, 51, 199 

one-one, 230 


periodic, 71, 164, 298 


Index 


polynomial, 42 

positive part of, 51 

power, 60 

product of, 43 

quotient of, 43 

rational, 42 

real part, 541 

real-valued, 541 
“reasonable”, 68, 118, 149, 180 
regulated, 524 

square root, 546-547 

Step, 278 

strict maximum point of, 218 
sum of, 43 

trigonometric, 303 ff. 

value at x, 40 

vector-valued, 244 
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Fundamental Theorem of Algebra, 377, 


338, 549, 567 
Fundamental ‘Theorem of Calculus 
First, 285 
Second, 289 


Gabriel, 408 

Galileo Galilei, 148, 164 
Gamma function, 394, 444 
Generating line, of a cone, 80 
Geometric mean, 33 
Geometric series, 473 

Global property, 123 


~ : Dae 
Goes to, “x goes to sin(x*)”, 45 


Graph of polynomial function, 197—198 


Graph sketching, 196-201 


Graphs, 5765765 die) Olea eal Ss, 


542 

Gravitation, 330 
Greatest lower bound, 134 
Grin and bear it, 385-386 
Gronwall’s mequality, 356 
Grow 

at the.same rate as, 361 

faster than, 361 
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Hadamard, see Cauchy-Hadamard 
formula 
Half-life (of radioactive substance), 356 
Hermite, Charles, 443 
High-school student’s real numbers, 599 
Higher-order derivatives, 16] 
Hilbert, David, 443 
Horizontal axis, 4 
Hyperbola, 67, 82 
equation in polar coordinates, 88 
Hyperbolic cosine, 353 
Hyperbolic functions, 353 
Hyperbolic sine, 353 
Hyperbolic spiral, 316 
Hyperbolic tangent, 353 


Identity 
additive, 9 
multiplicative, 9 
Identity function, 43 
Identity operator, 574 
Imaginary axis, 534 
Imaginary part function, 541 
Imaginary part of a complex number, 
53! 
Implicit differentiation, 241 
Implicitly defined, 24] 
Improper integral, 301-302, 397-399 
Incantation for derivative of quotient, 
171 
Increasing at aye21@ 
Increasing function, 195 
Increasing sequence, 457 
Indefinite integral, 365 
Indefinite integrals, short table of, 
365-366 
Induction, mathematical, 21 
complete, 23 
Inductive set of real numbers, 34 
Inequalities, 9 
in an ordered field, 584 
Inequality 
Bernoulli's, 32 
Cauchy-Schwarz, 281 
ecometric-arithmeuc mean, 33 
Gronwall’s, 356 


Jensen’s, 228 


Liouville’s, 448 
Schwarz ldeed3e2 281 
triangle, 71 
Young’s, 276 
Infimum, 134 
“Infinite” derivative, 158 
Infinite intervals, 57 
Infinite product, 329, 395, 497 
Infinite sequence, 452, 555 
Infinite series, 471 
multiplication of, 486 
Infinite sum, 430, 471 
Inhnite trumpet, 408 
Infinitely many primes, 32 
“Infinitely small”, 155, 264 
Infinity, 57 
minus, 57 
Inflection point, 225 
Initial conditions for differential equa- 
tions, 440 
Instantaneous speed, 152 
Instantaneous velocity, 152 
Integer, 25 
Integrable, 258 
Integral, 258 
classical notation for, 264-265 
definite, 365 
improper, 301-302, 397-399 
indefinite, 365 
short table of, 365 366 
Leibnizian notation for, see Integral, 
classical notation for 
lower, 295 
Mean Value Theorem for, 277 
Second Mean Value Theorem for, 
39] 
upper, 295 
Integral form of the remainder, 423 
Integral sign, 258 
Integral test, 478 
Integration 
by parts, 366 ff. 
by substitution, 369 II. 
limits of, 258 
of rational functions, 377 ff. 


Interest (finanee), 355 


Intermediate Value Theorem, 124, 131, 
135, 299 
Interpolation, Lagrange, 49 
Intersection of sets, 43 
Interval, 56 
closed, 57 
infinite, 57 
open, 56; see also Nested Intervals 
‘Theorem 
Inverse 
additive, 9 
multiplicative, 9 
Inverse of a function, 231 ff. 
Inverse square law, 330 
Inverses of trigonometric functions, 
see ‘Trigonometric functions 
Irrational numbers, 25 
Isomorphic fields, 602 
Isomorphism, 602 


Jensen’s mequality, 228 
Johnson, Samuel, 607 


Jump, 60 


Kepler, Johannes, 330 
Kepler’s laws of planetary motion, 330 


Lagrange form of the remainder, 423, 
436 

Lagrange interpolation formula, 49 

Large negative, 64 

Least upper bound, 133 ff, 584 

Least upper bound property, 135 

Lebesgue, see Riemann-Lebesgue 
Lemma 

Left-hand derivative, 156 

Leibniz, Gottfried Wilhelm, 155, 264 

Leibniz’s formula, 184 

Leibniz’s Theorem, 481 

Leibnizian notation for derivatives, 154- 
156, 167, 187, 241 

for higher order derivatives, 162 
Lemma, 102 
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Lemniscate, 89 
Length, 278, 283 
L’Hopital, Marquis de, 148 
L’Hopital’s Rule, 204, 213-214 
Limit, 90 ff, 98, 542 
at infinity, 106 
“does not exist”, 101 
from above, 106 
from below, 106 
of a sequence, 453 
of vector-valued function, 246, 252 
uniqueness of, LOO 
Limit comparison test, 475 
Limit of integration, 258 
Limit point, 469, 552 
Limit superior, 143, 467 
Lindemann, Ferdinand von, 447 
Line, real, 56 
Line, tangent, see ‘Tangent line 
Linear functions, 58 
Liouville, Joseph, 448 
Liouville’s inequality, 448 
Liouville’s Theorem, 567 
Lipschitz of order a, 210 
Local maximum poit of function, 189 
higher-order derivative test for, 417 
second derivative test for, 201 
Local mmimum point of function, 189; 
see also Local maximum point 


Local property, 109, 123, 166 
Local strict maximum point, 218 
Log, 341, 346 

Taylor polynomials for, 413 

remaimder term for, 427 

Logarithm 

classical approach to, 357 

Napierian, 358 

of a complex number, 570 

to the base 10, 339 
Logarithmic derivative, 351 
Lower bound, 134 

almost, 142 

greatest, 134 


Lower integral, 295 
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Lower limit of integration, 258 
Lower sum, 254 
Lowest terms, 73 


Maclaurin, see Kuler-Maclaurin summa- 


tion formula 

Major axis of ellipse, 87 
Mass, rate of change of, 152 
Mathematical induction, 21 
Maximum of two numbers, 16 
Maximum point of a function, 188 

local, 189; see also Local maximum 

point 

local strict, 218 

strict, 218 
Maximum value of function, 188 
Mean 

arithmetic, 33 

geometric, 33 
Mean Value Theorem, 193, 194 

Cauchy, 204 

for integrals, 277 

Second, 391 

Method of exhaustion, 141 
Minimum of function, 188 
Minimum of two numbers, 16 
Minimum point of a function, local, 


189; see also Local minimum point 


Minor axis of ellipse, 87 
Minus infinity, 57 
Afinfici logarithmonum canonis description, 
358 

Modulus of a complex number, 534 
Mollerup, Johannes, 394 
Multipheation, 5 

associative law for, 9 

closure under, 9 

commutative law for, 9 

of arabic numerals , 8 

of complex numbers, 531 


geometric interpretation, 535 536 


of function and vector-valued func- 
tion, 245 
of infinite series, 486 
of number and vector, 77 
of vectors , 77 
Multiplicative identity, existence of, 9 


Multiplicative inverses, existence of, 9 
Muluplicity (of a root), 130, [85-186 


Napier, John, 358 
Napierian logarithm, 358 
Natural numbers, 21, 34 
Negative, large, 64 
“Negative infinity”, derivative, 158 
Negative number, 9 

Negative numbers, product of two, 7 
Negative part of a function, 51 
Nested Interval ‘Theorem, 142 
Newton, Isaac, 155, 276, 330 
Newton’s law of cooling, 356 


Newton’s laws of motion, 161 
Newton’s method, 464 
Nondecreasing function, 243 
Nondecreasing sequence, 457 
Nondifferentiable complex functions, 
551 
Nonincreasing function, 243 
Nonincreasing sequence, 457 
Nonnegative function, 51 
Nonnegative sequence, 474 
Norm, 78, 252 
Notational nonsense, 573 


Nowhere differentiable continuous 
function, 509 
nth root, 71, 536 
existence of, 125, 536, 553 
primitive, 540 
Null’set, 23 
Number 
algebraic, 442 
complex, 526, 531 
counting, 21] 
even, 25 
imaginary, 526 
irrational, 25 
natural, 21, 34 
odd, 25 
prime, 31 
rational, 25 
réal, 25, 534, 589 
transcendental, 442 
Numbers, basic properties of, 3 


Odd function, 51, 199 
Odd number, 25 
One-one function, 230 
Open interval, 56 
Bee 
Order of equality, 418 
Ordered field, 583 
complete, 584 
Ordered pair, 47 (footnote), 54 
Origin (of a coordinate system), 57 


Pair, 46 
ordered, 47 (footnote), 54 
Parabola, 60, 82 
area under, 263 
equation in polar coordinates, 88 
Parallelogram, 76 
Parameterized curve, tangent line of, 
246 
Parametric representation of a curve, 
244 
Partial fraction decomposition, 378 
Partial sums, 47] 
Partition, 254 
Parts 
Abel’s formula for summation by, 392 
integration by, 366 ff. 
Pascal’s triangle, 27 
wheak , 6! 
Peak point, 458 
Period of a function, 71, 164, 298 
Periodic function, 71, 164, 298 
Perpendicularity of lines, 70 
Petara, H., 552 
Pig, yellow, v, 375 
Pigheaded, 186 
Plane, 58 
complex, 533 
Planetary motion, Kepler’s laws of, 330 
Point, 56 
Point of contact, 220 
Point-slope form of equation of a line, 
59570 
Polar coordinates, 84 ff. 
Polynomial function, 42 
graph of, 61, 197 ff. 
multiplicity of roots, 130, 185-186 
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Polynomials, Bernoulli, 575 
Pope, Alexander, 330 
Position, rate of change of, 152 
Positive element of R, 593 
Positive elements of an ordered 
field, 583 
Positive number, 9 
Positive part of a function, 51 
Power functions, 60 
Power series, 510 
complex, 557 
centered at a, 510, 564 
Powers of 2, table of, 432 
Prime number, 31 
characteristic of a field, 586 
infinitely many of, 32 
unique factorization into, 31 
Primitive, 363 
Primitive nth root, 540 
Principia, 276 
Product, 5 
Cauchy, 493, 513 
infinite, 329, 395, 497 
of function and vector-valued func- 
tion, 245 
of functions, 43 
of number and vector, 77 
of two negative numbers, 7 
of vectors, 77 
Pyramid 
surface area of, 403 
volume of, 407 
Pythagorean theorem, 25, 58 
ma, 305 
Archimedes’ approximation of, 141 
irrationality of, 326 
relation to e, 448, 564 
transcendentality of, 447 
value of, 433 
Viéte’s product for 2/7, 329 
Wallis’ product for 7/2, 395 


Quaternions, 587 


Quotient of functions, 43 
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Rabbits 
growth of population, 32 
Radian measure, 63, 304 305 
Radioactive decay, 355-356 
Radius of convergence of complex 
power series, 559 
Rate of change of mass, 152 
Rate of change of position, 152 
Ratio test, 476 
delicate, 493 
Rational functions, 42 
integration of, 377 ff. 
Rational numbers, 25 
Real axis, 533 
Real hne, 56 
Real number (formal definition), 589 
Real numbers, 25 
algebraist’s, 598 
analyst’s, 598 
Archimedian property of, 138 
construction of, 588 ff. 
high-school student’s, 599 
inductive set of, 34 
Real part function, 541 
Real part of a complex number, 531 
Real-valued function, 541 
Rearrangement of a sequence, 483 
“Reasonable” function, 68, 118, 149, 
180 
Rectangle, closed, 547 
Recursive definition, 23 
Reduction formulas, 377 
Regulated function, 524 
Remainder term for ‘Taylor poly- 
nomials, 421, 423, 437 
Removable discontinuity, 121 
Reparameterization, 247 
Revolution 
ellipsoid of, 405 
solid of, 402 
Riemann sum, 282 
Riemann-Lebesgue Lemma, 320, 391 
Right-hand derivative, 156 
Rismg Sun Lemma, 143 
Rolle, Michel, 186 
Rolle’s ‘Vheorem, 193 
Root 
multiplicity of, 130, 185-186 


Root of a polynomial function, 50 
double, 185; see also nth roots 
Root test, 493 
delicate, 493 


Same sign, 12 
Scalar, 78 
Scalar product of vectors, 78 
Scuiwarz ie A.w 217 
Schwarz inequality, 17, 33, 281 
Schwarz second derivative, 438 
Schwarzian derivative, 184 
Sec, 310 
derivative of, 310 
inverse of, see Arcsec 
Secant line, 150 
Second coordmate, 57 
Second derivative, 161 
Schwarz, 438 
Second derivative test for maxima and 
minima, 201 
Second Fundamental Theorem of 
Calculus, 289 
Second Mean Value Theorem for 
Integrals, 391 
Sequence 
absolutely summable, 480 
Cauchy, 459 
complex, 571 
equivalence of, 599 
complex numbers, 555 
convergent, 453 
pointwise, 502 
uniformly, 502 
decreasing, 457 
divergent, 453, 556 
ibondcem 32, 02)5572 
increasing, 457 
infinite, 452, 555 
limit of, 453 
nondecreasing, 457 
nonincreasing, 457 
nonnegative, 474 
rearrangement of, 483 
summable, 472 
Series 
absolutely convergent, 480 
conditionally convergent, 481 
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Series (continued) Substitution 
convergent, 472, 556 integration by, 369 ff. 
Fourier, 318,320, 323 world’s sneakiest, 386 
geometric, 473 Substitution formula, 369 
power, 510, 557 Subtraction, 5 
Taylor, 511 sum 
met, 22 finite, 3-4 
empty, 23 infinite, 430, 471 
Sets lower, 254 
intersection of, 43 of an infinite sequence, 472 
notation for, 43-44 of an infinite sequence of complex 
Shadow point, 143 numbers, 555 
Shell method, 403 of functions, 43 
Sigma, 24 of vector-valued functions, 245 
Sign, 12 of vectors, 75 
Simpson’s rule, 401 partial, 471 
Sin, 43, 303-304, 306; 321-322, 563 sigma notation for, 24 
addition formula for, 313, 314 upper, 254 
derivative of, 172, 307 Sum of squares, 552 
inverse of, see Arcsin Summable, 472, 556 
Taylor polynomials for, 412 Abel, 522 
remainder term for, 424 absolutely, 480 
Sine, hyperbolic, 353 Cesaro, 493 
Sine function, 43 uniformly, 506 
Sih 353 Summation by parts, Abel’s formula for, 
Sketching graphs, 196 201 a7. 
Skew field, 587 Supremum, 134 
Slope of a straight line, 58 Surface area 
Solid of revolution, 402 of cone, 404 
Speed, instantaneous, 152 of pyramid, 403 
Spiral of solid of revolution, 404 
Archimedian, 85, 249 Swift, Jonathan, 580 
hyperbolic, 316 Symmetry in graphs, 199 


Square root, 12, 527 
existence of, 124 
Square root function, 546-547 


Square root in a field, 586 Van. 310 

Squaring the circle, 447 derivative of, 310 

Step function, 278 inverse of, see Arctan 

Strling’s Formula, 578 ‘Taylor series for, 573 

Straight line Tangent, hyperbolic, 353 
analytic definition, 58 Tangent line, 149, 151 
shortest distance between two points, of parameterized curve, 246 

278 point of contact of, 220 

slope of, 58 “Tangent line’, vertical, 158 

Strict maximum point, 218 Vanlie > 53 

Strictly convex, 228 Taylor polynomial, 412 ff. 

Sturm Comparison Theorem, 323 remainder term of, 421, 423, 427: 


Subsequence, 458 see also specific functions 
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Taylor series, 511 
‘Taylor's Theorem, 424 
Torus, 405 
Transcendental number, 442 
Trapezoid rule, 400 
‘Triangle inequality, 71 
Trichotomy law, 9 
Trigonometric functions, 303 {f., see also 
cos, cot, CSc, sec, sin, tan 
integration of, 376-377 
inverses of, 310 ff, see also arccos, 
arcsec, arcsin, arctan 
‘Trumpet 
infinite, 408 
Truncation of a polynomial, 434 
Two-time differentiable, 161 


Uniform limit, 502 
Uniformly continuous function, 144 
Uniformly convergent sequence, 502 
Uniformly convergent series, 506 
Uniformly distributed sequence, 469 
Uniformly summable, 506 
Uniqueness 

of factorization into primes, 31 

of limits, 100 
Unit circle, 66 
Upper bound, 133, 584 

almost, 142 

least, 133 
Upper integral, 295 
Upper limit of integration, 258 
Upper sum, 254 


“Valley”, 61 
Value 
absolute, see Absolute value 


Value of f at x, 40 


Vanishing condition, 473 
Vector-valued functions, 244 
Vector-valued functions 
determinant of, 246 
derivative of, 246 
dot product of, 246 
limit of, 246, 252 
multiplication of function by, 245 
sum of, 245 
Vectors, 75 
addition of, 75 
as forces, 76 
dot product of, 78 
multiplication by numbers, 77 
multiplication of, 77 
scalar product of, 78 
Velocity 
average, 152 
instantaneous, 152 
Vertical axis, 57 
Viete, Francois, 329 
Volume, 402-403 
of solid of revolution, 402 


Wallis’ product, 395 

Weakly convex, 228 

Weierstrass, see Bolzano-Weierstrass 
Theorem 

Weierstrass M-test, 507 

Well-ordering principle, 23 

Wright, Edward, 388 


Young’s inequality, 276 


Zan. 26 
Zero, division by, 6 
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